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RESPONSE TO PAPER NO. 20 

In response to the Office action mailed March 27, 1995, the 
time for response to which is extended with the accompanying 
transmittal letter inlcuding a petition, reconsideration and 
reexamination are respectfully requested. 

One rejection stands against claim 13 of this application. 
Claim 13 is rejected under 3 5 U.S.C. § 112, first paragraph, for 
allegedly failing to provide an adequate written description of 
the invention and allegedly failing to provide an enabling 
disclosure. This rejection is respectfully traversed. 

Applicants response separates this rejection into the two 
issues of first fulfilling the written description requirement 
and second fulfilling the enablement requirement. 
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Applicants' specification literally sets forth the nucleic 
acid claimed. As noted by the Examiner, the last line of page 
12 of the specification indicates that the claimed nucleic acid 
is specifically disclosed. Paper No. 20 at page 2. To find a 
literal disclosure of the subject matter claimed, one could read 
the sequence on Figs. 11 and 12. With this literal support, it 
is not understood how there can be a question as to the 
specifications' fulfillment of the written description 
requirement . 

The purpose of the written description of § 112 has been 

clearly stated by the Federal Circuit: 

The purpose of the "written description" 
requirement is broader than to merely explain 
how to "make and use"; the applicant must also 
convey with reasonable clarity to those skilled 
in the art that, as of the filing date sought, 
he or she was in possession of the invention. 
The invention is, for purposes of the "written 
description" inquiry, whatever is now claimed. 
(emphasis in original) 

Vas-Cath Inc. vs. Mahurkar , 935 F.2d 1555, 1563-1564, 19 

U.S. P. Q. 2d 1111, 1117 (Fed Cir. 1991). In this case, by 

specifically setting out in the specification the nucleic 

acid as it is now claimed, one of ordinary skill in the art 

would certainly have known that applicants' were in 

possession of the claimed invention. Therefore, applicants 

submit that they have met the written description requirement 

of 35 U.S.C. § 112, first paragraph. 

As noted by the Examiner (Paper No. 2 0 at pages 2-3), 

the specification also sets forth specific uses for the 
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claimed invention. Other uses, such as in cloning HIV-1 DNA 
and related DNA, would be readily apparent to one of skill in 
the art . 

The Examiner asserts, however, that the specification 
does not set forth how to make and use the invention for the 
asserted purposes. Paper No. 20 at page 3. The recited 
sequence could have been made by chemical synthesis or 
recombinant DNA methods using applicants' disclosure. No 
barriers exist, and none have been asserted, to making the 
claimed invention . 

With respect to the use of the claimed invention, 
applicants note that "use" has been clearly defined by the 
Federal Circuit in a case such as this. "Usefulness in 
patent law, and in particular in the context of 
pharmaceutical inventions, necessarily includes the 
expectation of further research and development." In re 
Brana, 51 F.3d 1560, 1568, 34 U.S.P.Q.2d 1437, 1442 (Fed. 
Cir. 1995) . Therefore, applicants need not demonstrate that 
the claimed invention can be used to produce a marketable or 
finished diagnostic product, vaccine, or therapeutic. 

Furthermore, the Examiner has supplied no reasons why 
the claimed invention could not be used to clone HIV-1 
sequences or as a hybridization probe. Since only one use 
suffices and each of these uses would be readily apparent to 
one of skill in the art, this rejection, inter alia, is in 
error. 
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The Examiner suggests that the known and routine methods 
for expressing an open reading frame would not apply to the 
claimed invention. Brown et al . is cited as alleged evidence 
for what "must be considered for successful expression of a 
given gene sequence." Paper No. 20 at page 3. 

However, the skilled artisan was often faced with the 
routine task of expressing an open reading frame. Exhibits 1 
and 2, attached hereto, show general methods for identifying 
and expressing open reading frames and localizing important 
immunoreactive regions of polypeptides expressed from the 
open reading frames. Gray et al . , Exhibit 1, shows vectors 
and methods for expressing open reading frames known to one 
of skill in the art at the time of filing. Nunberg et al . , 
Exhibit 2, shows methods for expressing and determining 
antigenic peptides from an open reading frame. In addition, 
as commercial enterprises were, at the time of filing, 
producing and selling vectors for expressing and analyzing 
open reading frames, applicants submit that the Examiner's 
assertions are incorrect. One did not need specific 
instructions on how to express the sequence as claimed. 

The Examiner also mentions the glycosylation of the 
expressed polypeptide. Paper No. 20 at page 3. Since this 
plainly is the type of "further research and development" 
discussed in In re Brana (see above) , these arguments do not 
appear relevant to the patentable use of the claimed 
invention. The arguments addressing useful vaccines and 
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diagnostics at page 4 of Paper No. 20 also encompass "further 
research and development" that applicants need not show for 
the claimed invention. 

With respect to the use of the claimed invention as a 
hybridization probe, the Examiner asserts "no specific 
conditions or methods are given which would allow distinction 
between HIV-1 and other retroviruses." Paper No. 2 0 at page 
5. However, why such a demonstration need be shown is not 
understood. 

One of skill in the art was clearly familiar with 
hybridization methods and assays as well as with the possible 
factors that can be varied. Well known also was the formula 
for determining the melting temperature of a "hybrid." See 
the Maniatis excerpt (Maniatis et al . Molecular Cloning, 
1982, pp. 324-5, 388-9), attached as Exhibit 3, showing the 
routine considerations for hybridization assays known to one 
of skill in the art. See also the Wu excerpt (Wu et al. 
eds., Methods in Enzymology, 1983, vol. 100, pp. 266-285), 
attached as Exhibit 4. Applicants set forth the entire HIV-1 
DNA sequence with the G+C content available from that 
sequence. As described in both Maniatis and Wu, one could 
have routinely calculated the melting temperature, thus the 
hybridization conditions, for assaying a sample with HIV-1 
DNA with this information. No further guidance is needed. 

Applicants contend that one of skill in the art could 
have easily made a probe that only detects HIV-1 from the 
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present disclosure. However, even if the alleged cross- 
hybridization to other HTLV viruses the Examiner mentions 
(page 5 of Paper No. 20) did occur, under the patent laws, 
this would not prevent the use of the claimed invention. A 
probe that identifies all the members of the HTLV family can 
be a useful probe for that purpose. 

In fact, the Examiner's contentions regarding the cross- 
hybridization cannot be properly based upon the contents of 
the Arya et al. and Hahn et al . documents. Both Arya et al., 
at page 928 and the legend of Fig. 2, and Hahn et al., at 
page 168 and the legend of Fig. 4., express the desire to 
show "homology" and "relatedness" between the members of the 
HTLV family of viruses. Hahn et al. explicitly states "low 
stringency conditions" in the legend of Fig. 4, which one of 
skill in the art knows would foster cross-hybridization. 

However, the Examiner uses Arya et al . and Hahn et al . 
to allegedly demonstrate that they show something beyond 
their contents, i.e. that a hybridization assay 
discriminating between the members of the HTLV family is not 
possible with the claimed invention. This is especially 
inappropriate in a case such as this where the objective of 
both Arya et al . and Hahn et al. are obviously different from 
the use the Examiner is analyzing for the claimed invention. 

For these reasons, applicants claimed invention is 
enabled by the specification and a prima facie case of lack 
of enablement has not been set forth. Applicants 



respectfully request withdrawal of this objection and 
rejection. 

If there are any other fees due in connection with the 
filing of this Response, please charge such fees to our 
Deposit Account No. 06-0916. If a fee is required for an 
Extension of Time under 3 7 C.F.R. § 1.13 6 not accounted for 
above, such an Extension is requested and the fee should also 
be charged to our Deposit Account. 



Respectfully submitted, 



FINNEGAN, HENDERSON, FARABOW, 
GARRETT & DUNNER 



By: 




David J. Kullk_ 
Reg. No. 36,576 



Date: September 25, 1995 
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Open reading frame cloning: Identification, cloning, and expression 
of open reading frame DNA 

(loci gene fusion/frameshift mutant/hybrid protein) 
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ABSTRACT A plasmid was constructed that facilitates the 
cloning and expression of open reading frame DNA. A DNA frag- 
ment containing a bacterial promoter and the amino terminus of 
the cl gene of bacteriophage A was fused to an amino-terminally 
deleted version of the loci gene. An appropriate cloning site was 
inserted between these two fragments such that a frameshift mu- 
tation was introduced upstream of the lacZ-encoding DNA. This 
cloning vehicle produces a relatively low level of 0-galactosidase 
activity when introduced into Escherichia coll The insertion of 
foreign DNA at the cloning site can reverse the frameshift mu- 
tation and generate plasmids that produce a relatively high level 
of 0-galactosidase activity. A large fraction of these plasmids pro- 
duce a fusion protein that has a portion of the A cl protein at the 
amino terminus, the foreign protein segment in the middle, and 
the lacZ polypeptide at the carboxyl terminus. The production of 
a high level of 0-galactosidase and a large fusion polypeptide guar- 
antees the cloning of a DNA fragment with at least one open read- 
ing frame that traverses the entirety of the fragment. Hence, the 
method can identify, clone, and express (as part of a larger fusion 
polypeptide) open reading frame DNA from among a large col- 
lection of DNA fragments. 

The use of recombinant DNA technology has increased our 
understanding of the organization and expression of eukaryotic 
DNA. While many goals can be achieved with existing tech- 
nology, certain features of eukaryotic gene organization are still 
attainable only with difficulty. In particular, the identification 
of protein-coding segments within a large block of eukaryotic 
DNA is quite challenging. One should be able to take advantage 
of properties that are unique or highly enriched in coding DNA. 
Perhaps the most ubiquitous and specific feature of coding DNA 
is the presence of open reading frames. Indeed, analysis at the 
DNA sequence level has proved invaluable in identifying genes 
or gene segments. Although there are many examples of genes 
in which relatively small exons are interrupted by relatively 
large and numerous introns, this characteristic appears to be 
relatively infrequent in lower eukaryotes. Also, higher eukary- 
otic genes have been analyzed that contain open reading frames 
of substantial length. 

In this report, we describe a method for cloning pieces of 
DNA based on this property. We have exploited the elegant 
work of Beckwith and co-workers (1), who have demonstrated 
the power and utility of gene and operon fusions to the Esch- 
erichia coli 0-galactosidase gene (the lacZ gene). Many of these 
studies rely on the fact that the enzyme, /3-galactosidase (0-D- 
galactoside galactohydrolase, EC 3.2.1.23) is often biologically 
active when it carries other protein segments at its amino ter- 
minus. Indeed, it appears that in yeast, as well as in £. coli, 

The publication costs of this article were defrayed in part by page charge 
payment. This article must therefore be herebv market, "advertise- 
ment" in accordance with 18 U. S. C. §1734 solel'v to indicate this fact 



galactosidase can carry a wide range of protein sequences at its 
amino terminus and still retain substantial biological activity 
(2, 3). We have taken advantage of this property to create a lacZ 
vector for the cloning of open reading frames. We have intro- 
duced a frameshift mutation by inserting a DNA linker at the 
junction of a clkcZ fused gene. As expected, this frameshift 
results in a relatively low level of lacZ activity. The DNA linker 
encodes a cloning site. Consequently, the vector is able to 
screen for segments of open reading frame DNA, since a con- 
tinuous open reading frame sequence is a necessary condition 
for high-level expression of the lacZ sequence downstream from 
the insert. A successful construction results in the synthesis of 
a lacZ fusion polypeptide that includes the protein segment 
coded for by the inserted open reading frame. The vector is able 
to identify, clone, and express open reading frame DNA in one 
step. 

METHODS AND MATERIALS 

Materials. Restriction enzymes, T4 DNA ligase, and BAL- 
31 nuclease were purchased from New England BioLabs and 
Bethesda Research Laboratories. Calf intestine phosphatase 
was purchased from Boehringer- Mannheim; 5-bromo-4-chlo- 
ro-3-indolyl /3-D-galactoside (XGal), o-nitrophenvl 0-D-galac- 
topyranoside (ONPG), and isopropyl )3-D-thiogaIactoside 
were purchased from Sigma. XGal plates and MacConkey agar 
plates (Difco) were prepared as described by Miller (4). Sma 
l/BamHl adapter was purchased from New England BioLabs. 

Antibodies. Rabbit anri-0-gaJactosidase was a gift of Beth 
Rasmussen, rabbit anti-cl was a gift of Jeffrey Roberts, and the 

I-labeled IgG fraction of a goat anti-rabbit immunoglobulin 
Fc serum was a gift of Susan Lowey and Joan Press. 

Bacterial Strains. The lac deletion strain LG90 (F~ Mac pro 
XIII) was used exclusively (2). 

Plasmid DNA Preparation. Plasmid DNA was prepared by 
standard CsCl/ethidium bromide centrifugation methods (5) or 
from minilysates prepared by using the alkaline lysis procedure 
(6). 

Plasmid Constructions. Plasmid DNA was digested with 
commercial restriction enzymes as recommended by the sup- 
pliers. Where indicated, restricted DNA was treated with suf- 
ficient phosphatase [10 mM Tris-HCl (pH 8.0), 30 min at 37°C] 
to reduce self-ligation of a comparable amount of vector DNA 
by a factor of at least 1/100. DNA was sonicated in 5 ml of TE 
buffer (10 mM Tris-HCl, pH 8.3/1 mM EDTA)/0.2 M NaCl 
with six 30-sec bursts at maximum power, concentrated on a 
0.2-ml column of DEAE-cellulose, eluted with TE buffer/1 M 
NaCl, and precipitated with ethanol. DNA was resuspended in 
TE buffer, digested lightly with BAL-31 nuclease (sufficient to 

Abbreviations: XGal, 5-bromo-4-chloro-3-indolyl 0-r>galacto$ide; bp, 
base pair(s). 
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remove approximately 25 nucleotides from each end of a similar 
quantity of control restriction enzyme-digested DNA frag- 
ments) at 30°C, and fractionated by size on 10% acrvlamtde/ 
Tris borate/EDTA (TBE) gels along with appropriate molecular 
weight markers (7). 

Ligations. Cohesive-end ligations were carried out as follows: 
100 ng of plasmid vector and various amounts of eukaryotic 
DNA were ligated for 3 hr at 22°C in a final volume of 100 fil 
with 0.1 (Bethesda Research Laboratories) unit of T4 DNA li- 
gase. Blunt-end ligations were typically carried out as follows: 
500 ng of plasmid vector and various amounts of potential insert 
DNA were ligated overnight at 16°C in a final volume of 10 /xl 
with 1 unit of T4 DNA Iigase. 

Transformation. Bacterial transformation into LG90 was car- 
ried out by standard procedures (8), with the following modi- 
fications. All ligated DNA samples were phenol extracted and 
ethanol precipitated prior to transformation, which caused a 
reproducible 5- to 10-fold increase in transformation efficiency. 
Transformed cells (1 ml) were spread on the appropriate indi- 
cator plates containing ampicillin at 25 fJLg/ml Colonies were 
scored for phenotype on MacConkey agar plates after 24 hr at 
37°C. If the plates were crowded, the colony phenotype was 
scored at 48 hr. 

Hybrid Protein Analysis. Three-milliliter samples of cells 
from a saturated culture (grown in L broth containing ampicillin 
at 50 fig/ ml) were centrifuged, the pellet was suspended in 150 
Ml of 1.2-fold concentrated sample buffer (9) and heated at 
100°C for 3 min, and the viscosity was reduced by repeated 
passages through a 22-gauge syringe needle. Between 4 and 8 
Ml of the protein samples were electrophoresed in 7.5% acryl- 
amide gels (9). Protein blotting was carried out according to 
published procedures (10). 

0-Galactosidase Assays. These were carried out according 
to Miller (4) except that cells were grown in L broth. 

RESULTS 

Our initial hypothesis was that a frameshift mutation placed near 
the amino terminus of an open reading frame sequence that 
contains the lacZ sequence at its carboxyl terminus would pro- 
duce a plasmid that would be phenotypically lacZ" when trans- 
formed into E. coll Furthermore, the insertion of foreign DNA 
at or near the frameshift could correct the frameshift mutation 
and generate a lacT insert DNA-containing plasmid. The two 
constructions shown in Fig. 1A were designed to test this hy- 
pothesis. A segment of the cl gene from bacteriophage A ter- 
minating at a HindlU site at codon 157 was fused to a laclZ frag- 
ment. The cl gene fragment also contains two lac promoters that 
are oriented toward cl . Both constructions were identical except 
that two different /flcZ-containing plasmids were used. These 
two plasmids, pLG200 and pLC400, differ in DNA sequence 
only in the immediate vicinity of the Hindlll site (12). Because 
the DNA sequence of the A cl gene is also known (13), as well 
as the reading frame of the lacI-lacZ (laclZ) fusion protein en- 
coded by both pLC200 and pLC400 (12), we could predict the 
reading frame of the two resultant plasmids. Plasmid pMRl has 
24 bases between the lysine codon at amino acid 157 in the cl 
gene and the leucine codon at the beginning of the loci fragment 
(Fig. 1A). Since none of these eight codons are stop codons, this 
construction reads in frame from the A cl gene into the laclZ 
fusion and consequently produces a high level of 0-galactosidase 
activity. In contrast, pMR2, constructed from pLG200, has 10 
bases between the AAC lysine codon of the A cl gene and the 
CTG leucine codon of lad . Because this number of intervening 
bases is not divisible by 3, this construction- should shift the 
frame, resulting in improper reading of or termination in (or 
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AAGCC AAGOTT G CGOTCCC GACCTfrG<3CTC (in frame) 

WirtH TTT / O \ *"7qc"^ 1 



Hind IT (8am) 

*£l leu 
AAGCCAAGOTTGGCTC 

HindEE lacl" 



lacz 1 



pMR2 
(out of frame) 
lacZ" 




(out of frame) 
lacZ" 



Xcl HindHT Bom 8 Qm - (eu 

AAGCCAftG QPTQGGAff C CCOOSGGRTCCGGRG CTTGG CTG 



Smo 



loci 



? G ; /h J*"?"^ 011 of vectors - w > Plasmids pKB252 (11), pLG200 
and pLG400 (12) were manipulated by standard procedures to produce 

pKB252 waa ***** 

with Hat ^m/Hir^m and the largest fragment was purified. pLG200 
(or pLG400) was digested with Sma l/Hindm and the largest fragment 
was purified. These two fragments were ligated together and ampi- 
cUlin-resistant colonies were selected. The restriction sites of the re- 
sultantplaamids were verified. pMRl was derived by using pLG400 
and pMR2 was derived by using pLG200. Both plasmids, when trans- 
formed into LG90, form blue colonies on XGal plates. Thus, lac* and 
lac refer to phenotypes on MacConkey agar plates. The DNA sequence 
at thejoint is inferred from the sequence of the A cl gene up to the 
c lr T?^ n ? ) ^ d the ^eace in the vicinity of the Hindm site 
of PLG200 and P LG400 (12). (B) One hundred nanograms of BamHL 
digested pMRl and 2 ng of BamHl/Sma I adapter were ligated (final 
vol, 100 mI) with 0.1 unit of ligase for 3 hr at room temperature. DNAs 
were transformed into LG90, and ampicillin-resistant lac" colonies 
were scored on MacConkey agar plates. DNA sequences in the vicinity 
ot the cloning sites were verified by direct sequence analysis. 

both) the laclZ fusion sequence. This plasmid results in a rel- 
atively low level of 0-galactosidase activity. In addition, the in- 
sertion of eukaryotic DNA into the Hindlll site of pMR2 is able 
to generate colonies with a high level of lacZ activity (data not 
shown). 

To construct a vector of more utility, pMRl was manipulated 
as in Fig. IB. The construction inserted 10 base pairs (bp) at 
the BamHl site and therefore changed the reading frame so that 
the lacZ sequence is no longer properly translated. Moreover, 
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it has created a blunt-ended Sma I site, flanked by two BamHl 
sites, within this frameshifted region. The Sma I site provides 
a cloning site into which blunt-ended DNA can be cloned. In- 
sert DNA can be excised by digestion with BamHl 

To verify that this second-generation vector was suitable for 
cloning open reading frame DNA, Hae Ill-digested pBR322 was 
cloned into the Sma I site of pMRlOO. After transformation into 
LG90, 28 red colonies were chosen at random and analyzed for 
plasmid DNA insert size. All 28 clones yielded an additional 
BamHl fragment when analyzed by acrylamide gel electropho- 
resis; 8 of these clones are shown in Fig. 2. A very limited subset 
of the possible pBR322 Hae III fragments are cloned when am- 
picillin-resistant colonies are chosen on the basis of a strong 
lacZ + phenotype. As expected, most white colonies from this 
ligation contain plasmid DNA with no detectable insert, since 
most of these colonies are presumably derived from recycliza- 
tion of the vector (see Table 1). White colonies that harbor a 
plasmid with a DNA insert do not show the dramatic insert size 
preference seen with red colonies; DNA fragments of many 
different sizes are obtained from white colonies (data not 
shown). 

The sequence of pMRlOO predicts that two criteria must be 
met for a DNA fragment inserted into the Sma I site to generate 
a proper open reading frame between the A cl fragment and the 
laclZ fragment, (t) The inserted DNA must be of a size 3n - 
1 (where n is an integer) and (ti) the inserted DNA must contain 
no stop codons in the reading frame set by the frame of the A 
cl gene. Examination of the size and sequence (in both orien- 
tations) of the pBR322 Hae HI fragments predicts that, of the 
22 Hae III fragments, only 3 meet the two criteria: a 104-bp 
fragment from the tetracycline-resistance gene (which contains 
the single pBR322 BamHl site), an 89-bp fragment, and an 80- 
bp fragment, the latter two from the ampicillin-resistance gene. 
The 89-bp fragment has an appropriate open reading frame in 
both orientations, while the 104-bp fragment is cleaved by 
BamHl into two fragments, a 78- and a 26-bp fragment. Con- 
sequently, the sequence data predict that 4 fragment orienta- 
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Fig. 2. Cloning of Hae Ul fragments of pBR322 into pMRlOO. 
pBR322 DNA was digested to completion with Hae Ul and the resul- 
tant fragments digested with phosphatase (to prevent multiple in- 
serts). The phosphatase-digested DNA was ligated with pMRlOO and 
transformed into LG90. Red colonies were picked, and DNA was iso- 
lated from minilysates and digested with BamHl. DNA was analyzed 
on 10% acrylamide/Tris borate/EDTA gels. Lanes: 1 and 6, Hae III 
pBR322 fragment standards; 2-5 and 7-10, DNA from random red 
colonies. Arrows indicate the 104-bp and 89-bp Hae III pBR322 
fragments. 
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Table 1. Cloning of pBR322 Hae Ul fragments into pMRlOO 
and pMR200 



Ligation 




Vector/pBR322 


Colonies 


mixture 


Vector 


ratio* 


Red White 


1 


pMR200* 


1:1 


349 13 


2 


pMR200 


1:5 


300 65 


3 


pMRlOO 


1:5 


11 594 



Phosphatase-digested pBR322 Hae Ul fragments were ligated witl 
Sma I-digested pMRlOO or Sma I-digested pMR200. Ligated DNA waf 
transformed into LG90 and colonies were scored on MacConkey agai 
plates. Because pMR200 is a lacZ* vector having the Sma I cloning sit< 
at the same position as pMRlOO, the fraction of white colonies ** the 
fraction of religated vector that contains an insert. This value waf 
*20% in ligation mixture 2 but, in ligation mixture 3 under identicaJ 
conditions, 11/605 colonies are red. Normalizing for the expected frac- 
tion of colonies from this ligation that contain an insert ( 20% = same 
value as ligation mixture 2), the fraction of pBR322 Hae Ul fragments 
that can reverse the frameshift ~ (11/605) x 0.2 = 10%. 
* Arbitrary. 

f pMR200 was derived from pMRlOO as a rare lacZ* colony (on 
MacConkey agar) after digestion of pMRlOO with Sma I, self-liga- 
tion, and transformation. It is almost certainly a single-base-pair 
deletion of pMRlOO, placing the reading frame back in register. [The 
Sma I site (Fig. 1) is retained despite the deletion.] The size of the 
lacZ polypeptide is identical to that produced by pMRl and all re- 
striction sites in pMRlOO are still present. Since almost all inserts 
into the Sma I site are lacZ , the vector can be used to measure insert 
frequency. 

tions out of a possible 44 (22 fragments x two orientations: 
should produce red colonies. This value of 10% is consistent 
with experiment (Table 1). Moreover, there should be two size 
classes of insert fragments produced by BamHl digestion, 103 
and about 93 nucleotides (89- and 78- to 80-nucleotide frag- 
ments plus 14 nucleotides from the DNA between the Bam/ 
Sma sites shown in Fig. 1). Consistent with expectation, 26 of 
the 28 red colonies examined at random have one of the two size 
classes of DNA insert shown in Fig. 2. Moreover, double diges- 
tion of plasmids derived from these red colonies confirms that 
some of the larger size DNA inserts are indeed derived from 
the 89-bp Hae III fragment, which has been cloned in both 
orientations. 

The above data suggest that the vector pMRlOO can select 
suitable open reading frame fragments from among a larger 
group of DNA segments. Because of the nature of the cloning 
vehicle, a productive DNA insert should be expressed as a part 
of a larger fusion polypeptide. To verify this prediction, we 
cloned into pMRlOO sonicated DNA (carefully size fractionated 
to. avoid cloning small DNA fragments) derived from a cloned 
piece of Drosophila melanogaster DNA. Eleven red colonies 
were chosen for detailed analysis; the data are summarized in 
Table 2. Five of the 11 colonies contain proteins that are bigger 
(by approximately 20,000daltons) than the initial cl-lacIZ fusion 
protein, consistent with a DNA insert size of approximated 500 
bp (Table 2 and Fig. 3). These large proteins react with both 
anti-cl and anti-j3-galactosidase antisera (Table 2 and Fig. 4), 
consistent with tie hypothesis that they are proteins of the gen- 
eral structure cl-eukaryotic piece-lacIZ. These five strains con- 
tain additional smaller proteins, the largest of which is approx- 
imately 20,000 daltons less than the initial cl-lacZ fusion protein 
of pMR200. This protein reacts positively with anti-£-galacto- 
si<kse antiserum but does not react with anti-cl antiserum. A 
protein with similar properties is also visible in the strain with 
pMR200. Two of the colonies (nos-. 2 and 4) have properties 
suggesting that they are lacZ + revertants of pMRlOO generated 
by a deletion of perhaps 1 bp during cloning. Four of the col- 
onies (nos. 7, 9, 10, and 11) contain only the smaller proteins 
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Table 2. Insertion of cloned sonicated Drosophila DNA 
into pMRlOO 
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A Drosophila melanogaster clone, pDm2837, was a generous gift of 
Welcome Bender. This eukaryotic insert, an 8.1-kilobase (kb) Sal I 
fragment in the Sal I site of pBR322, has been identified on the basis 
of genetic criteria to include the ry* gene, which codes for xanthine 
dehydrogenase (Welcome Bender and Arthur Chovnick, personal com- 
munication). The 8.1-kb Sal I insert was purified by agarose gel elec- 
trophoresis and electroelution and then sonicated. DNAs of 400-550 
bp were selected for cloning, and 11 red colonies were chosen for sub- 
sequent analysis. No colonies or data were discarded from the analysis. 
0-Gal, /3-gaIactosidase; — , no detectable insert or protein band; ND, 
not done. 

* Protein size refers to the prominent 0*galactosidase polypeptide^) 
visible on NaDodS0 4 gels as assayed by Coomassie blue staining and 
blotting with anti-0-galactosidase antibody. 

'"Big" refers to a 0-galactosidase polypeptide 15,000-20,000 daltons 
larger than the cl-lacIZ fusion in pMR200. "Small" refers to a 0-ga- 
lactosidase polypeptide 15,000-20,000 daltons smaller than the 
cl-lacIZ fusion in pMR200; none of these "small" proteins reacts with 
anti-cl-antibody. 

(no, 7 is shown in Figs. 3 and 4). The properties of these poly- 
peptides are identical to those of the small protein of the other 
clones described above. The origin of this small protein is un- 
certain (see Discussion). 




Fig. 3. Gel analysis of hybrid proteins. Proteins from various 
sources were analyzed by NaDodS0 4 gel electrophoresis and the gel 
was stained with Coomassie blue. Lanes: 1, HB101; 2, strain HB101 
was grown overnight at 37°C in the presence of 1 mM isopropyl-S-D- 
thiogalactoside; 3, LG90; 4, pMRlOO; 5, pMR200; 6-11, clones 1 3, and 
5-8, respectively (see Table 2). Arrows: lower, wild-type lacZ (lane 2)- 
middle, cI-IacE fusion protein (lane 5); upper, large insert-derived fu- 
sion proteins (lanes 6-9 and 11). 

- 1) and entirely open reading frame ~(l/3) x (0.953) 1 = 55 
x 10' 4 for 400-bp DNA and 1. 1 x KT 4 for 500-bp DNA. The 
probability that a segment of DNA that is derived from a larger 
piece of open reading frame DNA is of the correct length and 
entirely open reading frame = the probability that it is the cor- 
rect length (1/3) x the probability that it is in the correct frame 
(1/3) x the probability that it has cloned in the correct orien- 
tation (1/2) = 5.5 x 10-* The difference between these num- 
bers suggests that an open reading frame clone from a 400- to 
500-bp DNA insert can be considered with some confidence to 
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DISCUSSION 

The experiments presented here validate our initial hypothesis: 
a plasmid vector having a frameshift near the amino terminus 
of a gene that codes for a polypeptide with lacZ* activity can 
be used to identify, clone, and express open reading frame seg- 
ments of DNA. Although other methods are available to carrv 
out these functions, they usually require detailed information, 
often at the DNA sequence level, about the gene or genes of 
interest. The value of the vector and approach described here 
is that no information need be known to clone and express frag- 
ments of open reading frame DNA from among a larger number 
of fragments. 

The data presented have been chosen from a large number 
of similar experiments to illustrate some important features of 
the methodology. When shotgun-cloning random fragments of 
DNA, it is important to size the DNA carefully prior to ligation. 
Because the frequency of random open reading frames increases 
in inverse proportion to DNA fragment size, the selection of red 
colonies will also select for small DNA inserts, even if the av- 
erage DNA size is quite large (data not shown). In the absence 
of any information about the DNA to be cloned in this way, we 
have chosen a DNA insert size of 400-500 bp. The probability 
that a random piece of DNA of length x is the correct size (3n 



Fig. 4. Blot analysis of hybrid proteins. Proteins from various 
sources were analyzed by protein blotting and antibody staining with 
anto-cl antibody (A) and anti-0-galactosidase antibody (£). Lanes: 1 
pMRlOO; 2, pMR200; 3-9, clones 1 and 3-8, respectively (see Table 2)! 
The arrow indicates the small protein discussed in the text. The ad- 
ditional smaller protein bands present in all lanes (except lane 1) are 
probably fusion protein degradation products that lack discrete por- 
tions from the carboxyl terminus, since these bands are visible with 
anti-cl antibody. 
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be a bona fide open reading frame in the DNA from which it 
was derived. This level of confidence decreases considerably 
as a function of insert size. Such an argument suggests that genes 
with small exons and large introns are not generally approach- 
able in this way although sonicated double-stranded cDNA 
could be cloned in a similar fashion. 

The data in Table 1 argue that the strategy enriches signifi- 
cantly for open reading frame DNA segments. On the other 
hand, some false positives are certainly generated, as shown in 
Table 2 and Figs. 3 and 4. Some red colonies contain plasmid 
DNA with no detectable DNA insert and a lacZ polypeptide of 
a molecular weight identical to the initial cl-lacIZ fusion pro- 
tein. As indicated above, these colonies are probably due to a 
small amount of exonuclease activity present during Sma I 
digestion or ligation. This source of red colonies is more prob- 
lematic when the fraction of bona fide open reading frame col- 
onies is low [e.g., when cloning genomic DNA from higher eu- 
karyotes (unpublished experiments)]. There are also red col- 
onies with proper DNA inserts but no detectable large fusion 
proteins. We have generated colonies of this phenotype from 
many sources of eulcaryotic DNA (data not shown), they are 
always associated with the presence of a small lacZ polypeptide 
of molecular weight similar or identical to the lacIZ-encoded 
portion of the fusion protein. Because a similar lacZ polypeptide 
is visible in protein from pMR200 and all of these small poly- 
peptides fail to react with anti-cl antibody, we favor the inter- 
pretation that they are due to proteolysis of the larger cl-pos- 
itive fusion protein. Presumably, this occurs to a variable extent 
with different fusion proteins. Alternatively, these small pro- 
teins from the insert DNA-containing plasmids may be due to 
translational restarts near the insert-/ac/Z junction, this matter 
might be clarified by determination of the DNA sequence of 
several inserts from plasmids of this type. 

Despite these two types of fusion proteins, a substantial frac- 
tion of red colonies contain bona fide open reading frame inserts 
as determined by the nature of the fusion protein they generate 
(Fig. 4). We expect this general approach to prove useful for a 
number of purposes. In addition to those described here, many 
of the fusion proteins generated in this or in a similar way may 
be antigenic and immunogenic. Large numbers of lac + colonies 



could be selected on lactose minimal plates and screened wit 
immunological reagents. Thus, it should be feasible to clon 
specific genes on this basis. The methodology should also serv 
as a rapid and convenient way to proceed in the opposite di 
rection [i.e., from a gene (or a subgene region) to an antibod 
reagent directed against a portion of the proteins coded for b 
the starting DNA]. 

We are grateful to the individuals who supplied reagents. Also w. 
thank Drs. Donald Winkelmann and L. E. Cannon for advice abou 
protein blotting and M. A. Osley and L. M. Hereford for helpful dis 
cussions. This work was supported by National Institutes of Healti 
Grant HD-08887 (to M.R.). M.R.G. was supported bv a predoctora 
Kessner Fellowship. 

Note Added in Proof. The large proteins from clones 5 and 6 react pos 
itively with antixanthine dehydrogenase antiserum (a gift of Arthu 
Chovnick), showing that the eulcaryotic protein pieces are antigenic. 
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ABSTRACT A method is presented whereby antigenic de- 
terminants recognized by specific monoclonal antibodies can 
be mapped to specific sites on a protein sequence with high 
resolution. Short DNase l-generated DNA fragments encoding 
portions of the protein of interest are molecularly cloned into 
the EcoRl site of the 0-galactosidase gene of phage \ Charon 
16 so as to obtain expression of random protein fragments as 
fusion proteins. The monoclonal antibody is used to screen the 
phage library to isolate phage expressing the specific antigenic 
determinant! DNA of immunoreactive phage can be analyzed 
rapidly and subcloned to allow DNA sequence determination. 
The method is generally applicable and permits antigenic de- 
terminants of functionally interesting monoclonal antibodies 
to be mapped and related to specific protein sequences. We 
have used this procedure to determine the region of the feline 
leukemia virus envelope protein gp70 recognized by a virus- 
neutralizing monoclonal antibody, cl.25. Antibody binding 
was mapped to a 14-amino acid region in the ammo-terminal 
half of gp70. This region may be directly involved in an essen- 
tial function of the gp70 protein, perhaps in gp70-mediated 
host recognition functions. Synthetic peptides derived from 
this region may provide useful vaccine antigens for the preven- 
tion of feline leukemia virus-associated disease in cats. 



Monoclonal antibodies {mAbs) have been invaluable for the 
understanding of structural and functional features .of viral 
proteins. Epitopes recognized by specific mAbs define 
structural and immunological domains of proteins and serve, 
operationally, as ur its of protein structure. For only a smalt 
number of viral proteins, however, has it been possible to 
relate operationally defined mAb epitopes to specific protein 
sequences. Current methods used to define sites of antibody 
binding often involve low-resolution mapping to specific 
chemical or proteolytic fragments of the protein (1-3). Se- 
quence analysis of naturally arising viral variants that differ 
antigenicaliy has been widely used to correlate protein se- 
quence and immunologic differences (4, 5). This method has 
recently been extended to include the analysis of viral vari- 
ants selected for resistance to specific virus-neutralizing 
mAbs; epitopes involved in virus neutralization have been 
mapped precisely by this procedure (6-8). The use of syn- 
thetic peptides to generate specific polyclonal antibodies has 
also permitted the identification of functionally interesting 
domains within proteins (9). 

In this paper, we present a generally applicable high-reso- 
lution method to map specific antigenic determinants recog- 
nized by mAbs. Specific antibodies are used to screen 
recombinant phage libraries expressing, as fusion proteins, 
short random fragments of sequences encoding the relevant 
protein. The use of antibody probes to detect expression of a 
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variety of specific antigens in recombinant libraries has been 
described by others ( 10-12). Recently. Young and Davis (13) 
have described a method for high-density screening of phage 
libraries containing genomic DNA from Saccharomyces 
cerevisiae. Polyclonal antibodies directed against two sub- 
units of RNA polymerase II were used to identify phage ex- 
pressing, as fusion proteins, antigens derived from the re- 
spective genes. Ruther et al. (14) have used polyclonal anti- 
bodies to isolate lysozyme-encoding sequences from a 
plasmid library containing small random fragments derived 
from molecularly cloned genomic sequences: the smallest 
such fragment identified encoded 10 amino acids of lyso- 
zyme. The method presented here uses high-density phage- 
screening procedures to identify, within proteins, small im- 
munogenic regions recognized by specific mAbs. 

We have used this method to localize a region of the feline 
leukemia virus (FeLV) envelope protein gp70 recognized by 
a specific virus- neutralizing mAb. The FeLVs comprise a 
group of horizontally transmitted retroviruses associated 
with a variety of naturally occurring malignant and degenera- 
tive diseases in domestic cats (15-17). Products of the viral 
envelope gene, specifically the major envelope glycoprotein 
gp70, are believed to be involved in recognition functions 
relating to virus host range (18, 19) and to serve as primary 
targets for antibody-mediated virus neutralization (20, 21). 
FeLV gp70 protein is found on the viral envelope in associa- 
tion with the hydrophobic anchor protein pl5E; both pro- 
teins are generated during virus maturation by cleavage of 
the gp85 precursor protein found on the surface of infected 
cells. 

In this paper, we report the mapping of the antigenic deter- 
minant recognized by a virus-neutralizing mAb to a region of 
14 amino acids within FeLV gp70. The ability to localize, on 
the amino acid sequence level, epitopes recognized by func- 
tionally interesting mAbs should prove useful in correlating 
protein structure and function. 

MATERIALS AND METHODS 

Monoclonal Antibodies. mAbs directed against FeLV gp70 
were provided by N. Pedersen (University of California, Da- 
vis). gp85 complex composed of cross-linked gp70 and pl5E 
was prepared as described by Schneider et al. (22) from FL- 
74 FeLV and used to immunize mice. Hybridomas were pre- 
pared as described by Lutz et al. (23). FL-74 FeLV contains 
a mixture of all FeLV subgroups (A. B, and C) (18). 

Virus neutralization was determined in a focus-induction 
assay using CCC clone 81 (S"L*) cells (24) or by virus-yield 
reduction using PG -411(E) <S*L~) cells provided by D. K. 



Abbreviations; FeLV, feline leukemia virus; GA-FeLV, Gardner- 
Amstein isolate of FeLV; MuLV. murine leukemia virus; mAb. 
monoclonal antibody; bp. base pairts): Ch.16. phage X Charon 16; 
M m and Vf n . mass and number, respectively, average molecular 
weight: X-Cal. 5-bromo-4-chloro-3-indolyl 0-D-galactoside. 
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Haapala (Frederick Cancer Research Institute): virus was 
preincubated with antibody (30 min at 37°C) and allowed to 
infect cells as described by Fischinger et al. (24). Virus yield 
was determined by p27 ELISA (23). All sera and antibody 
preparations were heated to inactivate complement. 

Construction of Phage Library. Moiecularly cloned FeLV- 
B [Gardner-Arnstein (GA) isolate] provirus (XHF60) (25) 
was provided by A. Roach and N. Davidson (California In- 
stitute of Technology) in the form of the pKC7-derived plas- 
mid pKHR-1. Short random segments of the GA-FeLV en- 
velope gene were used to construct a library in X phage Char- 
on 16 (Ch.16) (26); fragments were derived from the FeLV 
envelope gene bv digestion with DNase I in the presence of 
VinCI; (27). The Xho \, EcoRl DNA fragment of pKHR-1 
containing the envelope gene was incubated with DNase I 
(Worthington) (100 jig of DNA/ 10 ng of DNase I per ml) at 
25°C. and time points were analyzed by ethidium bromide 
staining of agarose gels to determine mass average molecular 
weight (A/J. M m values of 500-700 base pairs (bp) were ob- 
tained by 10-30 min digestion and correspond, in our experi- 
ence, to number average molecular weight (A? n ) values of 
approximately 100 bp as determined by analysis of end-la- 
beled fragments. Fragments were made blunt-ended by 
treatment with DNA polymerase I (Klenow fragment) in the 
presence of dNTPs and ligated to phosphorylated EcoRl 
linkers (5' G-G-A-A-T-T-C-C 3'; Collaborative Research. 
Waltham. MA) as described by Maniatis et al. (28). The liga- 
tion mixture was digested with Eo>Rl and fractionated on 
cellulose (CF-11: Whatman) as described by Franklin (29) 
to remove linker monomer, residual triphosphate, and other 
contaminants. An estimate of actual M n could be obtained at 
this point. 

The £coRI site in the 0-galactosidase gene of Ch.16 was 
used to construct an expression library of random envelope 
gene fragments. The library was constructed (30) and pack- 
aged in vitro (31), as described by Maniatis et al. (28). Phage 
vector DNA was dephosphorylated using bacterial alkaline 
phosphatase prior to ligation to maximize insertion frequen- 
cy. The resulting library was amplified prior to screening. 

Immunoscreening. Phage were grown in the LacZ* Esche- 
richia colt K-12 strain LE392 (28) or in the LacZ" strain 
MC1000 (32) on plates (26) containing the chromogenic indi- 
cator 5-bromo-4-chloro-3-indolyl 0-D-galactoside (X-Gal). 
After overnight growth, a dry nitrocellulose filter (Schlei- 
cher & Schuell. BA85) was gently placed on the plate and 
marked for orientation. Phage were allowed to adsorb for 
30-60 min and filters were screened much as described by 
Young and Davis (33). Filters were rinsed in 10 mM 
Tris*HCl, pH 7.4/150 mM NaCl/1 mM EDTA/0.19& Na- 
DodS0 4 for 30 min and then incubated in 10 mM Tris-HCI. 
pH 7.4/150 mM NaCl/1 mM EDTA/0.29& Triton X- 
100/0.1% NaDodS0 4 (buffer B), containing Wc ovalbumin 
( Sigma) for 60 min to block nonspecific protein binding sites. 
These and all subsequent steps were carried out at room 
temperature. Incubations were carried out in polyethylene 
bags containing 3.5 ml of solution per filter. After incuba- 
tion, the filters were allowed to react with virus-neutralizing 
mAb (50 jig of IgG/ml) in buffer B/Wc ovalbumin for 3 hr 
and washed three times (5 min each) in the same buffer prior 
to incubation with rabbit antimouse IgG antibody (10 m8 of 
IgG/ml) (Cappel Laboratories, Cochranville, PA) for 60 min. 
This second antibody had been adsorbed against £. colt pro- 
teins (13) to reduce nonspecific binding. Optimal antibody 
concentrations were determined to maximize the signal-to- 
noise ratio. After three more washes in the same buffer, 
bound antibody was detected by reaction with l23 l-labeled 
Staphylococcus aureus protein A (5 x 10 7 cpm/jig) for 60 
min (or overnight at 4°C) in buffer B/1% ovalbumin (5 x 10 6 
cpm per filter). Filters were finally washed four times (5 min 
each) in this buffer, dried, and autoradiographed. Reliable 



signals were obtained overnight. Immunoreactive chas 
were picked and purified by repeated screenings. 

Nucleic Acid Analysis of Immunoreactive Phage. DN A fro m 
immunoreactive phage was purified as described by Mania- 
tis et ai (28), The size and number of random envelope g enc 
fragments inserted within reactive phage DNA was deter- 
mined by digestion with EcoRl. £a;RI-digested DNA w as 
end-labeled using DNA polymerase I ( Klenow fragment! and 
analyzed by polyacrylamide gel electrophoresis to re-oKe 
inserted fragments. 

To determine the DNA sequence of inserted random env e - 
lope fragments, an Sst \/Pst I DNA fragment from immuno- 
reactive phage containing the EcoRl cloning site as ueli as 
flanking 0-galactosidase sequences (see Fig. 3) was sub- 
cloned into plasmid pUC13 (34). This fragment [*=3.5 kilo- 
base pairs plus insert(s)) was easily isolable by molecular 
cloning from Pst I- and Sst I-digested phage DNA by virtue 
of the unique Sst I site. Phage DNA digested with Pst I and 
Sst I was ligated into similarly digested pUC13. and "unite" 
ampicillin-resistant transformants were selected for restric- 
tion endonuclease analysis of plasmid DNA. Digestion with 
EcoRl proved informative in this analysis. DNA sequence 
analysis was by the procedures of Maxam and Gilbert 1 35 >; a 
restriction map of lac operon sequences within Ch.16 was 
determined from the previously published DNA sequence of 
the lacz and lacy genes (36. 37). Only a portion of the iac\ 
gene is contained within Ch.16. 

Other Procedures. Molecular cloning techniques used 
were as described in Maniatis et al. (28). Nucleic acid en- 
zymes were obtained from New England Biolabs or Bethes- 
da Research Laboratories and used according to the manu- 
facturers* instructions. Immunological blotting analysis of 
expressed proteins was carried out as described by Erlich et 
ai (38); proteins were resolved on NaDodSO^poiyacrylam- 
ide gels and transferred electrophoretically to cyanogen bro- 
mide-activated paper. mAb cl.25 binding was detected by 
using l23 Mabeled protein A (5. aureus) after reaction with 
rabbit anti-mouse IgG antibody. All buffers used in immuno- 
logic probing contained 0.1^ NaDodSO 4 /0.2^ Triton X-100 
as described above. Direct ELISA assays for antigen were 
carried out as described by Voller et al. (39); bacterial pro- 
teins were solubilized by boiling in buffer containing 29c Na- 
DodSO4/50 mM dithiothreitol prior to dilution into plate- 
coating buffer. Synthetic oligopeptides were synthesized us- 
ing a SAM II automated peptide synthesizer (Biosearch. San 
Rafael. CA). 

RESULTS 

Virus-Neutralizing mAb cl.25. Several mAbs directed 
against the envelope glycoprotein gp70 of FeLV were exam- 
ined for the ability to neutralize FeLV in a focus-induction 
assay using CCC ci.81 (S'L~> cells. One antibody, cl.25 
(IgG). was found to neutralize most FeLV isolates tested, of 
all FeLV subgroups. Viruses neutralized include CT600 
FeLV-A (40), CF-927 FeLV-A (41). GA-FeLV-B (25). Sny- 
der-Theilen FeLV-B (18). and CF-927 FeLV-C (41 1. CF-927 
FeLV-B (41) was not neutralized. We were interested in de- 
termining the site on FeLV gp70 involved in cl.25-mediated 
virus neutralization. 

Construction of a Random DNA Fragment Library Express- 
ing FeLV gp70 Determinants. Our approach to the identifica- 
tion of the antigenic determinant of FeLV gp70 recognized 
by the mAb cl.25 involved the construction of a phage li- 
brary containing random small fragments of the FeLV enve- 
lope gene so as to obtain expression of random envelope 
antigens as fusion products with 0-galactosidase. Phage ex- 
pressing the specific antigenic determinant recognized by 
cl.25 were detected by immunoscreening using the antibody. 

Having demonstrated mAb cl.25-mediated neutralization 
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of the Gardner-Arnstein <GA) isolate of FeLV-B. we con- 
structed a random DNA fragment library from envelope 
cene sequences of molecularly cloned GA-FeLV-B (25). A 
3 l-kilobase-pair Xho 1/EcoRi DNA fragment containing the 
entire 2.0 kildbase pairs of envelope-encoding sequences 
was isolated and dieested with DNase I to a mass average 
molecular weieht of 500-700 bp. In our experience, this cor- 
responds to a \/ n of - 100 bp. allowing for expression of. on 
the average. 33 amino acid segments of the FeLV envelope 
protein. 

Random DNA fragments were molecularly cloned into the 
EcoRl site encoding amino acids 1007- L008 (Giu-Phe) of the 
0-ealactosidase gene in Ch.I6. Approximately 30,000 inde- 
pendent plaques were derived; roughly 20^r contained in- 
serts as judged hv in situ plaque hybridization (42) using the 
onginal A7i7* I EcoRl GA-FeLV DNA fragment as probe 
(data not shown). From this library. -2000 independent 
phage were amplified and used in immunoscreening proce- 
dures. Using the simplest of assumptions.* a library ot this 
size would be expected to yield two independent isolates ex- 
pressing any specific determinant. 

Immunoscreening Using mAb cl.25. To assess our ability to 
detect expression in £. coti of a 0-galactosidase fusion pro- 
tein containine the antigenic determinant recognized by 
cl.25. a recombinant phage. Ch.l6:GA. was constructed in 
which sequences encoding the entire GA-FeLV gp85 precur- 
sor were fused, in translational reading frame, to the EcoRl 
site of Ch.16. Using sequences derived from a synthetic 
polvlinker (5' G-A-A-T-T-C-C-C-G-G 3 ). we adapted the 
Bat I site encoding the FeLV envelope signal peptide 
gp85 junction (43, 44) to create an EcoRi site and allow an m- 
frame fusion with the EcoRl site of /3-galactosidase. Un- 
earned phage from the in vitro packaging reaction were 
screened for expression of the cl.25 determinant as de- 
scribed and the results are shown in Fig. L4. Purified phage 
were shown, bv restriction endonuclease analysis, to contain 
the expected FeLV sequences in the expected orientation 
and were also shown to produce a fusion protein detectible 
by mAb cl.25 by immunological blotting analysis (Fig. 2). 

These experiments define conditions for successful 
screening and demonstrate the ability of the specific mAb to 
recognize the bacterially produced antigenic determinant. 
Antibodies directed against carbohydrate moieties of glyco- 
proteins or complex conformational determinants of native 
proteins would be expected to fail to recognize the bacterial 
fusion protein. 

Initial immunological screening of the amplified DNA 
fragment library was carried out at a plaque density of =*10 
per 85-mm plate. Of 11 plaques chosen for examination, all 
but one gave consistently reliable signals on repeated screen- 
ing and" purification. The purification of one such phage 
(Ch.l6:7) is shown in Fig. IB. Purified phage were homoge- 
neous as judged by immunoreactivity and X-Gal staining 
(faint blue on"*£. coli strain MC1000). 

Analysis of Immunoreactive Phage DNA. DNA was isolat- 
ed from immunoreactive phage and the structure was exam- 
ined by restriction endonuclease analysis. Phage DN As were 
digested with EcoRl to liberate the insert fragments), and 
fragments were labeled at the EcoRl ends by using DNA 
Polymerase 1 (Klenow fragment). Polyacrylamide gel analy- 
sis showed all immunoreactive isolates to be identical: all 
contained one 51-bp EcoRl insert fragment. This result is 
consistent with the limited complexity of the amplified li- 
brary. 

Immunologic analysis of proteins expressed by one such 

"Estimation of expected frequency is derived as follows: 0.20 (inser- 
tion frequency) x (100 bp iM« of insert*/ MOO bp (complexity ot 
DNA used to construct library )| * 0.5 (orientation* * 0.3?ireadmg 
frame). 




Fig. I. Immunological screening of phage libraries for detection 
of expression of the antigenic determinant recognized by mAb el. 25. 
[A ) Phage generated in the construction of Ch.l6:GA *ere screened 
using the mAb. Ch.t6:GA contains sequences of the mature FeLV 
gp85 fused, in frame, to /3-galactosidase. (fl> Purification of an im- 
munoreactive phage <Ch.i6:7) from a library containing random 
FeLV envelope gene fragments. After the initial identification < B-l ). 
phage were purified by repealed immunological greening (B-2» to 
final homogeneity (B-3>. Purified phage *cre homogeneous b> im- 
munological screening and by X-Gal staining. Filter orientation 
markers are indicated by crosses. 

immunoreactive phage (Ch. 16:9-3) showed the presence of a 
fusion protein recognized by both mAb cl.25 and rabbit anti- 
j3-galactosidase antibody (Fig. 2). These results confirm the 
immunologic specificity of the screening procedure. The fu- 
sion protein migrated slightly slower on NaDodS0 4 /poly- 
acryiamide gel electrophoresis than did the wild-type Ch.16 
/3-galactosidase. 

To facilitate DNA sequence analysis of the 50-bp fragment 
involved in immunoreactivity. a region of phage DNA con- 
taining the EcoRl insertion site from Ch. 16:9-3 was sub- 
cloned into plasmid pUC13. The DNA sequence of the enve- 
lope gene fragment insert was determined by the chemical 
sequencing procedures of Maxam and Gilbert: sequence was 
read from the Nde I site 50 bp from the EcoRl cloning site. 
For the analysis of more complex libraries, an otigonucleo- 

QA 9-3 Ch.16Ch.30 GA »-3 Ch.16 CK30 



* 



Fig. 2. Immunological blotting analysis of protein from immuno- 
reactive phage. Protein was isolated from £. coti MC1000 cells con- 
taining OU6:GA IGA). Ch.l6:9-3 <<Mi. Ch.16. or Charon 30 and 
analyzed using, as probe, either rarbit anti-P-galactos^ase IgG 
[Left) or mAb cl.25 iRiteht). Proteolytic degradation products of the 
intact Ch.i6:GA fusion protein (•) axe Msible. Charon 30 docs not 
contain the 0-galactosidase gene and is included to demonstrate 
antibodv specificity. 0-gal. Native ^pdactostdase. 
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.teg gcggaa ttc cag ctg.... 
. ser ala gtu ph« gin leu .... 



£eo Rl 

EcoRI 

X Charon 16- ^ ^ °" ^ ATQ QQA CCA AAT CTA GT C CTQCCT OAT CAA AAA CCC CCA TCg ga. ttc cag C tg 

9-3 >tr *" fllu ph * ,#u M * T QLY PRO ASN LEU VAL LEU PRO ASP QLN LYS PRO PRO SER glu ph« g,„ t6(J 

GA gp70 V " 8 " THW ^ ° LV PR ° LEU VAL LEU PB ° ASP ° LN LYS P "° PRO S " AR Q QLN SER QLN „ 

Fig. 3. DNA sequence analysis of immunoreactive phage Ch. 16:9-3. OU6 DNA is diagrammed to indicate the position of the * «i a 
s.dase gene tlocz) and the EcoRI cloning site. Left and right "arms' of (he phage DNA are indicated The nucleic ^x^^cduc^t^^ 
sequences surrounding the £r«RI site are indicated below. The nucleic acid and deduced am no ac>S sequencl pretn ,n Ch T^V''" 

f r 1 r f v ?i° PC gene : cncoded ammo ac,ds " (Sequences derived from the synthetic EcoRI linkers Hank the insert > ThV?omMrab7e rL 
of GA-FeLV gp70 prote.n ,s mdicated below. The Pst I site referred to in the text .other Pst I sites exist I thin the 28.^^ t ?h°- n 
thymidine represents a mutation from the parental GA-FeLV sequence. »hich contains a cvtosine at this pos "o U ^ Inth.s ml, ' 
apparently arose dunng the molecular cloning of the random fragment library* mutation 



tide primer complementary to sequences adjacent to the 
EcoRI cloning site could be used to facilitate sequence 
determination from a large number of distinct pUC13 [or 
M13mpll (45)] subclones using primer-dependent dideoxy- 
nucleotide sequencing methods (46, 47). 

DNA sequence analysis of pUC13:9-3 demonstrated the 
presence of a 51-bp insert containing 43 bp derived from the 
envelope gene of Ga-FeLV (Fig. 3). This isolated envelope 
gene fragment represents nucleotides 635-678 of sequences 
encoding gp70 and is present so as to give rise to an ir-frame 
fusion protein with flanking 0-gaiactosidase sequences. The 
amino acid sequence of the encoded gp70 fragment (residues 
213-226 of gp70) is also shown in Fig. 3. Thus, the antigenic 
determinant recognized by virus-neutralizing mAb cl.25 is 
contained within this 14-amino acid region. 

To further demonstrate the presence of the mAb cl.25 de- 
terminant within this fragment, the 51-bp EcoRI fragment of 
pUC13:9-3 was recloned into the EcoRI site of an expression 
system plasmid containing the trpLE' gene (48). The £a>RI 
sites of 0-galactosidase and trpLE' exist in the identical read- 
ing frame (Glu-Phe), ensuring in-frame translation of the 
properly oriented 51-bp fragment with a trpLE' fusion pro- 
tein. Transformants were screened using mAb cl.25 in an 
EL1SA. and results were confirmed by immunological blot- 
ting analysis (data not shown): trpLE' fusion proteins con- 
taining this region of FeLV gp70 are also recognized by the 
mAb. The most direct indication that the antigenic determi- 
nant recognized by mAb cl.25 resides strictly within this 14- 
amino acid fragment was obtained through the use of a syn- 
thetic peptide containing this sequence. The peptide. Met- 
Gly-Pro-Asn-Leu-Val-Leu-Pro-Asp-Gln-Lys-Pro-Pro-Ser. 
was synthesized and shown to block the binding of mAb 
cl.25 to antigen in a gp70 ELISA and, in addition, to prevent 
mAb cl.25-mediated virus neutralization (data not shown). 

DISCUSSION 

The procedure described here appears to be generally appli- 
cable for the localization of antigenic determinants recog- 
nized by mAbs. The ability to isolate mAbs against specific 
proteins has been invaluable in defining and dissecting anti- 
genic and functional epitopes of proteins. Current methods 



to localize antigenic determinants recognized by mAbs to 
specific protein sequences, however, are complicated and 
not of general utility, van der Werf and co-workers (49, so> 
have recently described a procedure in which molecularly 
cloned sequences encoding a specific protein are expressed 
as a fusion protein in bacteria; internal deletions are then 
generated in vitro and correlated with loss of specific immu- 
noreactivity. The method described here uses phage screen- 
ing procedures to directly identify specific sequences encod- 
ing antigenic determinants of specific mAbs. Furthermore, 
this method requires no knowledge of the molecular target of 
the antibody; thus interesting proteins and antigenic determi- 
nants can be isolated from phage libraries derived from geno- 
mic DNA of more complex organisms from which important 
antigens have not yet been defined. 

This procedure has been used to determine the region of 
FeLV gp70 recognized by the FeLV-neutralizing mAb cl.25. 
This region, comprising amino acids 213-226 of gp70, is 
identically conserved between the closely related Snyder- 
Theilen-FeLV-B and GA-FeLV-B viruses (44) and is quite 
closely conserved between feline and murine leukemia virus- 
es (MuLVs) [e.g.. 659f (9/14) homology to Molonev-MCF- 
MuLV g P 70 (51)|. Isolated differences w ithin this reeion ex- 
ist among MuLVs (51-55). Virus host-range recognition 
functions are believed to reside within the ammo-terminal 
half of gp70. and it is interesting to speculate that this reeion. 
identified by reaction with a neutralizing mAb. mav be di- 
rectly involved in this process. Hydropathic analysis, using 
the algorithm of Hopp and Woods (56). shows that this re- 
gion lies within a local peak of hydrophilicity in gp70 (44) and 
thus, presumably, resides on the outside surface of the pro- 
tein. It is interesting to note that this relatively well-con- 
served region overlaps, in part, the amino terminus of a high- 
ly variable type-specific region within MuLV g P 70 proteins 
(52, 53, 55). It has been proposed that this latter region im- 
parts type specificity on the MuLV gp70. These two regions 
may represent one functional protein domain involved in 
host recognition; alternatively, mAb cl.25 may act stencally 
to block access to an adjacent type-specific region. 

The ability to define, on a viral protein, a site involved in 
antibody-mediated virus neutralization may serve to direct 
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the choice of synthetic peptides to be tested as vaccine ami- 
gens. Synthetic peptides to such a region of foot and mouth 
disease virus VPi have been tested in the development of a 
synthetic peptide vaccine against the disease (9). Peptides 
derived from the region of FeLV gp70 identified here may 
provide useful vaccine antigens for the prevention of FeLV- 
associated disease in cats. 

Note Added in Proof. Three independent FeLV-neutralizmg mAbs 
derived by Osterhaus and Weijer were also shown to recognize the 
synthetic peptide. These mAbs had previously been sho*n to com- 
pete with mAb cl.25 for gp?0 binding (A. Osterhaus ( National Insti- 
tute of Public Health. The Netherlands) and K. Weijer (The Nether- 
lands Cancer Institute/, personal communication). 

We are indebted to Niels Pedersen <l'ni\ersit\ of California. Da- 
vis, Veterinars Medicine) for providing us w uh monoclonal antibod- 
ies, viruses, and other reagents and to Frank McCormick for discus- 
sions from which this approach to mapping antigenic determinants 
"as derived. W e thank our colleagues at Cetus u ho v.^rc in\oKed 
m aspects of this work: Glenn Horn for molecular clomne into Char- 
on 16. Danute Nitecki for oligopeptide synthesis. Mark Williams for 
DNA sequence analysis. David Getfand for critical review of the 
manuscript, and Tom White for encouragement and support. Molec- 
ularly cloned GA-FeLV and virus were provided bv Norman David- 
son (California Institute of Technology ). This work was partlv sup- 
ported by funds from the TechAmerica Group. Inc. 
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324 THE IDENTIFICATION OF RECOMBINANT CLONES 



?ROBES ,ZAT,ON ° F ° R ^ ,MM0B,L,ZED 0N ALTERS TO RADIOACTIVE 

There are many methods available to hybridize radioactive probes in solu 
tion to DNA or RNA immobilized on nitrocellulose filters. ffnS; 
differ in the following aspects: metnods 

th formrmid a e n ) d tGmperatUre used (68 ° C in a( ^ ueous solution or 42°C in 50% 
the volume of solvent and the length of hybridization (large volumes for 
penods as long as 3 days or minimal volumes for time? as shoTas 4 

the degree and method of agitation (continuous shaking or stationary)- 
the concentration of the labeled probe and its specific activity ™' 

L2L? ™ mip0 ? nds ' such ? dext ™ sulfate, that increase the rate of 
reassociation of nucleic acids; 

the stringency of washing following the hybridization. 

Although the choice depends to a large extent on personal preference we 
would like to offer the following guidelines. preierence, we 

1. Hybridization reactions in 50% formamide at 42°C are easier to set ud 
present less of an evaporation problem, and are less harsh on the filters 
than is hybridization at 68°C in an aqueous solution. The kinetics of the 
hybridization reaction in 80% formamide are approximately three to four 
times slower than in an aqueous solution (Casey and Davidson 1977) 
Assuming a linear relationship between the rate of hybridization and 
formamide concentration, the rate in 50% formamide should be two times 
slower than m an aqueous solution. 

2. The smaller the volume of hybridization solvent, the better. The kinetics 
of nucleic acid reassociation are faster, and the amount of probe needed 
may be reduced so that the DNA on the filter acts as the driver for the 
reaction All these are important parameters when detecting clones of 
bw-abundance mRNAs. However, it is essential that sufficient liqufd be 

BJ^JISZ. to remain at aU times covered by a film 0f the 

3. Continual movement of the probe solution across the filter is unnecessary 
even for a reaction driven by DNA immobilized on the filter. However, if 
a large number of filters are hybridized simultaneously, agitation is 
advisable m order to prevent the filters from adhering to each other 

4. The kinetics of the hybridization reaction are difficult to predict from 
theoretical considerations, partly because the exact concentration of the 
immobilized nucleic acid and its availability for hybridization are un- 
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Eys* simp,y enter the ■^^esttsek 

1 Y Z 

X * ~5 * To * 2 = numb er of hours to achieve Qt t 2 
where, 

X = the weight of probe added (in M g) 

Y ~-'Z^?XT k $ obes ' compiexity is to the 

Z-the volume of the reaction (in ml) 

nA f I e I ^5?*? ti0T l J™ 3x Coth2 has been «Mhed, the amount of the 
probe available for additional hybridization to the filter is negligible For 

, nS 6 ;. ? n t ed / DNA Pr ° bes ' the ^"dization time may be shortened 

Starrs 

5. In the Presence of dextran sulfate, the rate of association of nucleic acids 
is accelerated because the nucleic acids are excluded from the vo S me of 
he solution occupied by the polymer. Their effective clcenlSn is 

therefore increased. The rate of association reportedly increases 10 foW in 
the presence of 10% dextran sulfate (Wahl et al 1979? 

Although dextran sulfate is useful in circumstances where the rate of 
hybridization is the limiting factor in detecting sequence?* ^interesUt is 
unnecessary for most purposes. It is also difficult to handle because of iS 
viscosity and sometimes can lead to high backgrounds 

6. ^ general the washing conditions should be as stringent as possible- i e 
a combination of temperature and salt concentration should be chosen 
hat ,s slightly (5°C) below the T m of the hybrid under study. The tempera" 

ture and salt conditions can often be determined empirically in P S- 
nary experiments where Southern blots (see pages 382ff) of genome 

£ni 6 %5 1Md * the Pr0be 0f interest then washed Snder 
conditions of different stringency. 
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TABLE 111 HYBRIDIZA TION CONDITIONS FOR SOUTHERN FILTERS 

sp. act. 

DN A on of probe DNA Amount of Time of 

filter (cpm//ig) probe added hybridization (hr) 



Fragments of 


10 7 


10 5 -10 6 cpm 


3-4 


cloned DNA 




(0.01-0.1 M g) 




(~ 100 ng/fragment) 








Total eukaryotic 


10 8 


1 x 10 7 cpm 


12-16 


DNA (10 




-5* 10 7 
(0.1-0.5 M g) 





8. Incubate the bag submerged in a water bath at 68°C for the required 
hybridization period. 

9. Remove the bag from the water bath and quickly cut along the length of 
three sides. Using gloves, remove the filter and immediately submerge it 
in a tray containing a solution of 2x SSC and 0.5% SDS at room 
temperature. 



Note. Do not allow the filter to dry out at any stage during the washing 
procedure. 

10. After 5 minutes, transfer the filter to a fresh tray containing a solution of 
2* SSC and 0.1% SDS and incubate for 15 minutes at room temperature 
with occasional gentle agitation. 

11. Transfer the filter to a flat-bottomed plastic box containing a solution of 
0.1* SSC and 0.5% SDS. Incubate at 68°C for 2 hours with gentle agita- 
tion. Change the buffer and continue incubating for a further 30 
minutes. 

Note. If the homology between the probe and the DNA bound to the filter 
is inexact, the washing should be carried out under less stringent condi- 
tions. In general, washing should be carried out at T m ~ 12°C. 
The following relationships are useful: 

a. r m = 69.3+ 0.41 (G + C)%-650/L. 

L equals the average length of the probe in nucleotides (Marmur 
and Doty 1962; Wetmer and Davidson 1968). 



b. The T m of a duplex DNA decreases by 1°C with every increase of 1% 
in the number of mismatched base pairs (Bonner et al. 1973). 
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c (rj M2 - (7^1 = 18.5 logiu-£f- 

where mi and are the ionic strengths of two solutions (Dove 
and Davidson 1962). 

12. Dry the filter at room temperature on a sheet of Whatman 3MM paper. 

13. Wrap the filter in Saran Wrap and apply to X-ray film to obtain an 
autoradiographic image (see page 470). 

Notes 

Hybridization may also be carried out in: 

a. flat-bottomed plastic boxes. 

b. buffers containing formamide. Each increase of 1% in the formamide 

T52o ra £ IOn l0We ^ he T ™ of a °NA duplex by 0.7°C (McConaughy et 
al. 1969; Casey and Davidson 1977) 
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using the dot blot assay described here. This assay has been successfully 
used in several laboratories. 1617 
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[19] Isolation of Multigene Families and Determination of 
Homologies by Filter Hybridization Methods 

By Gerald A. Beltz, Kenneth A. Jacobs, Thomas H. Eickbush, 
Peter T. Cherbas, and Fotis C. Kafatos 

A high proportion of genes in eukaryotes are now known to be members 
of multigene families. Although initial indications were provided by pro- 
tein sequencing, the ubiquity of multigene families has been revealed 
most convincingly by recombinant DNA methods. A list of some of the 
best known multigene families, by no means inclusive, would include the 
genes for histones, globins, immunoglobulins, histocompatibility anti- 
gens, actins, tubulins, seed storage proteins, chorion proteins, keratins, 
collagens, cuticle proteins, yolk proteins, heat shock proteins, and sali- 
vary glue proteins. Even genes for proteins that appear homogeneous 
may belong to multigene families; e.g., the chicken ovalbumin gene is 
now known to be expressed coordinately with two homologous genes, X 
and Y. In the field of hormone research, analysis of gene families related 
to well known effectors such as insulin or growth hormone is one of the 
most exciting areas of current investigation. Many members of multigene 
families show stage- or tissue-specific expression: a familiar example is 
the family of globin genes. Thus, the study of multigene families is impor- 
tant for understanding the mechanisms of physiological regulation, differ- 
ential gene expression, and molecular evolution in eukaryotes. 

In the study of multigene families by recombinant DNA methods, 
usually the first step is to isolate clones (cDNA or genomic) that represent 
all members of the family. The next step is to discriminate among clones 
that correspond to different genes. The third step is to relate each gene to 



METHODS in enzymology, VOL. 100 



Copyright © 1983 by Academic Press. Inc. 
All rights of reproduction in any form reserved. 

ISBN 0-12-182000-9 



ISOLATION OF MULTIGENE FAMILIES 



a particular type of protein product or to a specific RNA transcript. Ulti- 
mately, structural characterization of the genes, including sequence anal- 
ysis, is undertaken. Because of their sequence homologies, members of a 
multigene family often cross-hybridize. In the first three steps of such 
studies, careful control of the hybridization conditions is necessary— both 
to isolate the family members as cross-hybridizing species in the first step, 
and to permit their discrimination and characterization in the second and 
third steps. A number of filter hybridization methods are used in these 
studies: phage plaque 1 and bacterial colony hybridization, 2 dot blot hy- 
bridization, 3 Southern hybridization, 4 Northern hybridization, 5 and hy- 
brid-selected translation. 6 Their specific features are discussed elsewhere 
in this volume. Here, we wish to emphasize their relatedness and to point 
out how deliberate control of the hybridization stringency maximizes 
their utility. We shall give examples from our studies of the chorion gene 
families in silkmoths. 

The silkmoth chorion system has been reviewed. 7 More than a hun- 
dred genes belonging to several multigene families encode the structural 
proteins of the chorion. These families may themselves be related, consti- 
tuting a superfamily, 8 but the interfamily homologies are low and can be 
neglected in this discussion. Within each family, both closely and dis- 
tantly related genes are found: observed degrees of mismatching range 
from less than 1% to as much as 50%. 9 Based on this variation, members 
of each family have been classified into distinct types (distantly related 
genes, observed mismatching 10 to 50%) and into copies of the same gene 
type (more closely related genes, <1 to 5% mismatching). In our experi- 
ence, even gene copies differ by one or more substitutions that lead to 
amino acid replacements, and thus have already taken the first step to- 
ward evolving into distinct genes, by the criterion of differences in the 
encoded proteins. The sequence variations are not uniformly distributed 
within the chorion genes: in each family the sequence corresponding to a 
central protein domain is substantially conserved, whereas flanking se- 



1 W. D. Benton and R. W. Davis, Science 196, 180 (1977). 

2 M. Grunstein and D. S. Hogness, Proc. Natl. Acad. Sci. U.S.A. 72, 3961 (1975). 

3 F. C. Kafatos, C. W. Jones, and A. Efstratiadis Nucleic Acids Res. 7, 1541 (1979). 

4 E. M. Southern, /. Mol. Biol. 98, 503 (1975). 

5 J. C. Alwine, D. J. Kemp, and G. R. Stark, Proc. Natl. Acad. Sci. U.S.A. 74, 5350 (1977). 

6 R. P. Ricciardi, J. S. Miller, and B. E. Roberts, Proc. Natl. Acad. Sci. U.S.A. 76, 4927 
(1979). 

7 F. C. Kafatos, Am. ZooL 21, 707 (1981). 

8 J. C. Regier, F. C. Kafatos, and S. J. Hamodrakas, Proc. Natl. Acad. Sci. U.S.A. (in 
press). 

9 C. W. Jones and F. C. Kafatos, J. Mol. Evol. 19, 87 (1982). 
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quences that encode protein "arms" (NH 2 -terminal and COOH-terminal 
domains) are much more variable. 10 



Parameters Affecting Nucleic Acid Hybridizations 
A Qualitative Summary 

By definition, the course of any hybridization reaction is determined 
by the concentrations of the reacting species and the second-order rate 
constant k. The stability of the resulting duplex is measured by the melt- 
ing temperature, T m . For reassociation reactions in solution involving 
perfectly matched complementary strands, the effects of various reaction 
conditions have been investigated in detail, and thus T m and k values can 
be calculated with some precision. Neither calculation can be extended 
with confidence to cases involving imperfect hybrids or to solid-phase 
reactions. Nevertheless, the effects of reaction conditions can be ex- 
pected to be qualitatively similar, and thus they bear review here. 

In general, T m depends on the composition of the duplex (7 m increases 
with G + C content), on the duplex length (T m increases with length), on 
the ionic strength (T m increases with salt concentration up to a plateau), 
and on the concentration of any organic denaturants present (e.g for- 
mamide, urea). T m is higher for RNA: DNA hybrids than for the corre- 
sponding DNA : DNA duplexes, and the differential effect of increasing 
(G + C) is greater for the hybrids. 

Parameters such as (G + C), duplex length, ionic strength, and con- 
centration of any denaturants generally affect k in the same direction as 
they affect T m . Therefore, an additional parameter, which is of paramount 
importance for k, is the temperature of the reaction, T { . In general, k 
increases with T { , reaches a broad maximum at 25 to 20° below T m , and 
decreases thereafter, becoming severely depressed at T x > T m - 5°. An 
example of such temperature dependence is shown in Fig. 1, for the 
reassociation of T4 DNA in solution. 1 1 

The qualitatively similar effects of most reaction conditions on k and 
T m have led to the practice of summarizing the reaction conditions in 
terms of the criterion, which is numerically equal to r m minus T { . When 
the criterion is large, the reaction is described as permissive; when it is 
small, the reaction is stringent. While the absolute values of k vary drasti- 
cally as a function of individual parameters (e.g., salt), the relative values 
as a function of criterion are thought to be described in general by the 

0 fl982) Ham ° drakaS ' C W ' J ° neS ' F - C Kafat ° S * Bi ° Chim - Bhphys - Acta m > 42 
" T. I. Bonner, D. J. Brenner, B. R. Neufeld, and R. J. Britten, /. Mol. Biol. 81, 123 (1973). 
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Temperature below T m (°C) 

Fig. 1. Rate of reassociation as a function of temperature for normal (self-hybridization) 
and mismatched DNA (cross-hybridization). Data are replotted from Fig. 3 of Bonner et 
al. u ; they are derived from normal T4 DNA and T4 DNA partially deaminated with nitrous 
acid. The large open and filled circles are from the best-fit curve plotted in the original figure, 
and dotted lines are extrapolations assuming that at T m the rate of reassociation is zero. 
Under the conditions of the experiment (0.12 M phosphate buffer), the T m of normal T4 
DNA is 81°. The discrimination ratio, i.e., k of cross-hybridization divided by k of self- 
hybridization, is also presented. 



curve of Fig. 1, for the reassociation of perfectly matched strands in 
solution. 11 

The problem of discrimination among related nucleic acid sequences is 
one of devising hybridization conditions that are stringent for some se- 
quences and permissive for others. Although it is known that mismatching 
depresses both T m and k, its effect on the relationship between k and 
criterion has not been studied in detail. Figure 1 presents one of the few 
sets of systematic data available for hybridization between normal T4 
DNA and T4 DNA partially deaminated with nitrous acid (resulting in 
purine-pyrimidine mismatches that lower the T m value by 18°). The tem- 
perature dependence of k for this ' 'cross-hybridization' * (T4 DNA vs 
deaminated T4 DNA) shows some similarity to that for "self-hybridiza- 
tion" (T4 DNA vs T4 DNA), but the rate constant is lower and reaches its 
optimal value at a lower temperature. 
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From data such as those of Fig. 1 , one may calculate what we will call 
the "distribution ratio" for each temperature, i.e., the ratio between the 
rate constants for cross-hybridization and for self-hybridization reactions. 
The higher the value of this ratio, the easier it is to detect even distant 
homologs; the lower the ratio, the easier it is to discriminate between self- 
hybrids and cross-hybrids. It can be seen that the ratio varies with crite- 
rion in a sigmoidal manner. Although the exact shape of the sigmoid is not 
based on extensive empirical data, its general shape is not affected by 
minor variations in the individual k vs temperature curves: it depends on 
the bell shape of the curves and the shift of the cross-hybridization curve 
to lower temperatures. 

The sigmoidal curve for discrimination ratio vs temperature has im- 
portant implications for the study of multigene families. If we imagine a 
set of such curves, corresponding to different degrees of mismatching, we 
can understand why it is not possible to find a compromise criterion that 
would permit both detection of distant homologs and discrimination be- 
tween close homologs. If both goals are to be attained, the following two- 
step procedure is necessary. 

Step 1 . To recover all members of a multigene family, an initial screen 
should be performed at a very permissive criterion. From Fig. 1, it can be 
seen that a criterion 40° to 50° below T m is desirable if very distant homo- 
logs are to be recovered: at these temperatures the rate of self-hybridiza- 
tion is reduced two- to six-fold, but even distant homologs give a rela- 
tively strong signal. A practical limit is imposed by background problems. 
For example, if the probes have been cloned by G : C tailing, nonspecific 
hybridization due to the tails becomes significant at very permissive 
criteria. 

Step 2. To discriminate between homologs, a very stringent criterion 
should be used in a second hybridization step. Since it appears that the 
discrimination ratio drops steeply near the T m of the mismatched hybrids 
(Fig. 1), we recommend the use of criteria near the T m of the relevant 
cross-hybrids. Temperatures as high as 5° below the T m of the self-hybrids 
can be used without difficulty. 

In theory, these two steps might be compressed into a single two-step 
experiment: After an initial hybridization under permissive conditions, 
poorly matched hybrids might be recognized by their release in a subse- 
quent wash under stringent conditions. In practice, although this protocol 
is useful for discriminating between distant homologs, 3 we have found it 
not to be useful when the homologs are closely related. In part, the 
problem is technical. Although probes are for the most part reversibly 
hybridized, in filter hybridization a variable portion of the probe may 
become irreversibly bound. This problem can be minimized by scrupu- 
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lously avoiding drying the filter until after the melting reaction, but the 
precaution is not always sufficient. 

A more important consideration affecting the design of the experiment 
is that a stringent hybridization and a stringent wash are not entirely 
equivalent. A priori the stability of a hybrid during washing depends on 
the T m of the best-matched region of duplex. By contrast k reflects the 
nucleation frequency, which is an unknown function of the distribution of 
matched and unmatched stretches. In practice we find that a stringent 
hybridization is more discriminatory than an equally stringent wash. 

Quantitative Considerations 

It is clear from the preceding discussion that in the study of multigene 
families, selection of appropriate conditions requires attention to the pa- 
rameters that govern T m and k. We emphasize that systematic empirical 
data for matrix-supported reactions, and for mismatched sequences, are 
extremely limited. Thus, the appropriate conditions must be determined 
empirically in each case. However a starting point can be selected by 
calculations, and we summarize here the pertinent information from the 
literature for convenience in doing so. 

The rate constant for renaturation of randomly sheared DNA in solu- 
tion is given by the Wetmur-Davidson equation 12 

* - (kxL 0 5 )/N 

where L is the mean length of the reassociated duplex per nucleation; N is 
the complexity of the DNA; and k N is a length-independent nucleation 
rate constant. The length factor is quite uncertain, because it attempts to 
incorporate effects of length on both nucleation frequency and yield of 
duplex per nucleation. Therefore, the equation can be used only for a 
rough estimate if the hybridizing strands differ in length. The constant & N 
is influenced by such environmental factors as the incubation tempera- 
ture, the salt concentration, organic solvents, and various polymers. 13 In 
an aqueous solution at an incubation temperature of T m - 25° and 1 M 
Na + , /c N = 3 x 10 5 liters/mole-second. Attachment of one of the reacting 
species to a matrix decreases £ N , probably by an order of magnitude. 133 In 
addition, if a large amount of DNA is filter-bound, the hybridization may 
be limited by diffusion of the probe to the filter, unless the reaction vessel 
is shaken. 13a 

The effect of individual parameters on k is as follows. The change in 

12 J. G. Wetmur and N. Davidson, J. Mol. Biol. 31, 349 (1968). 

13 R. Wieder and J. G. Wetmur, Biopolymers 20, 1537 (1981). 

I3a R. A. FlavelU E. J. Birfelder, J. P. Sanders, and P. Borst, Eur. J. Biochem. 47, 535 (1974). 
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optimal k at T m - 25° as a function of salt has been tabulated, 1214 and is 
most dramatic below 0.4 M Na + . An 1% increase in G + C increases 12 
optimal k by a factor of approximately 0.018. Mismatching is reported 1114 
to depress the optimal k by a factor of 2 per 10° reduction in T m . Concen- 
trations of 80% formamide are thought to depress k by a factor of 3 for 
DNA-DNA hybrids and by a factor of twelve for RNA-DNA hybrids, 
although the data are limited. 15 Volume-excluding inert polymers can be 
used to increase /c. 1315a In an aqueous solution of 1 M NaCl at 70°, the 
renaturation rate increases 100-fold when dextran sulfate is added to 
40%. 13 

Concerning T m , a large number of quantitative studies can be summa- 
rized as follows 11 * 1516 : For a perfectly matched DNA duplex, 

T m = 81.5 + 0.41 (G + C) + 16.6 log(Na+) 

- 0.63(% formamide) - 300 + 

a 

where G + C = percentage of guanine + cytosine; Na + = molarity of 
(Na + ) or equivalent monovalent cation; and d = the length of the hybrid- 
ized duplex in nucleotides. 

It is prudent to bear in mind the limits of each term in this formula. The 
dependence on G + C is accurate over the range 30-75% G + C. 17 The 
salt dependence 16 is valid for 0.01 to 0.40 M (Na + ), only approximately so 
at higher (Na + ); T m is maximal at about 1.0-2.0 M (Na + ). The depression 
of T m by formamide 15 is greater for poly(dA:dT) (0.7571% formamide) 
than for poly(dG:dC) (0.5071% formamide), and the value used in the 
formula was derived from human rDNA (58-67% G + C). The effect of 
polynucleotide length on T m is somewhat controversial, but the correction 
listed 14 seems valid over the range 0.05-0.5 M salt. Finally, the formula 
applies to the "reversible" T m , e.g., as assayed by optical measurements. 
The " irreversible" T m9 which is of relevance for autoradiographic detec- 
tion, is usually higher by 7-10° in aqueous solutions. 18 

For estimating the T m of RNA-DNA hybrids, the paper of Casey and 
Davidson 15 should be consulted. The effect of formamide on such hybrids 
is nonlinear (unlike the effect on DNA-DNA hybrids), leading to the well- 
known preferential formation of RNA-DNA hybrids at high formamide 
concentrations. At 80% formamide, 0.3 M Na + , RNA-DNA hybrids are 

14 R. J. Britten, D. E. Graham, and B. R. Neufeld, this series, Vol. 29, p. 363. 

15 J. Casey and N. Davidson, Nucleic Acids Res. 4, 1539 (1977). 

I5a G. M. Wall, M. Stern, and G. R. Stark, Proc. Natl. Acad. Sci. U.S.A. 76, 3683 (1979). 

16 C. Schildkraut and S. Lifson, Biopolymers 3, 195 (1965). 

17 J. Marmur and P. Doty, J. Mol. Biol. 5, 109 (1962). 

18 K. Hamaguchi and E. P. Geiduschek, J. Am. Chem. Soc. 84, 1329 (1962). 
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reported to be 20-30° more stable than DNA-DNA hybrids. At 50% for- 
mamide, 0.3 M Na + , we find that the difference is approximately 1 1° in the 
case of /3-globin sequences. 3 

The effect of mismatching on T m is obviously very important for stud- 
ies on multigene families. The usual simplification is that T m decreases by 
1° for every 1 ± 0.3% mismatch, 14 reckoned for DNA-DNA duplexes of 
40% G + C with randomly placed mismatches. 14 In another study, per- 
centage homologies for BK and SV40 DNA fragments determined by 
heteroduplex analysis gave the best fit to actual sequence data when 0.57 
1% mismatch was used. 18a Obviously the distribution of mismatched 
bases will have a critical determining effect. Consider an extreme exam- 
ple: sequence A differs from sequence B by 50%. If the mismatch is 
clustered so that extended regions of A and B are identical is sequence, a 
high T m will be observed; by contrast, if the mismatch is dispersed so that 
every second nucleotide differs, no hybridization will occur. The effect of 
sequence on the hybridization of short oligonucleotides with mismatches 
has been studied. 19 ' 20 The extrapolation of these very interesting results to 
real-life problems will require some judgment and additional experimenta- 
tion. Conditions that permit detection of homologies between various 
unequally diverged regions of sequenced viral genomes have been investi- 
gated. 21 In the chorion system, we have noted that sequences that are 
highly divergent overall often show a smaller T m depression than ex- 
pected. 22 As an example, the B family clones pc401 and pc408 differ by 
37.2% overall, but only by 13.6% in the conserved central domain 9 ; the 
change in T m of the cross hybrids, as determined from dot blots, 22 is 
approximately 10°. 

It must be stressed again that the formulas given above were derived 
for solution hybridization, and thus are only rough approximations for 
filter hybridization. For example, it would not be surprising if the effec- 
tive length of filter-bound hybrids is reduced by steric constraints. In 
agreement with this possibility, we have the impression that for filter self- 
hybridizations k declines more rapidly than expected from Fig. 1, when 
the temperature is raised to within 15° to 5° of the experimentally deter- 
mined T m . The T m itself may be slightly depressed for filter-bound hy- 

,8a R. C. Yang, A. Young, and R. Wu, J. Virol. 34, 416 (1980). 

19 R. B. Wallace, M. Schold, M. J. Johnson, P. Bembek, and K. Itakura, Nucleic Acids Res. 
9, 3647 (1981). 

20 S. Gillam, K. Waterman, and M. Smith, Nucleic Acids Res. 2, 625 (1975). 

21 P. M. Howley, M. A. Israel, M.-F. Law, and M. A. Martin, J. Biol. Chem. 254, 4876 
(1979). 

22 G. K. Sim, F. C. Kafatos, C. W. Jones, M. D. Koehler, A. Efstratiadis, and T. Maniatis, 
Cell 18, 1303 (1979). 
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brids. For example, for £-globin DNA-DNA dot hybrids of 51% G + C, 
580 bp at 0.3 M Na" and 50% formamide, we have observed 3 a T m of 56°, 
whereas the formula presented above would predict 60.6° for reversible 
r m , and therefore at least 67° for irreversible T m . 

Two commonly overlooked variables in differential hybridization, 
which affect discrimination among nucleic acid sequences, are the extent 
of the reaction and the mass ratio of the radioactive probe to the unla- 
beled, filter-bound nucleic acid. In general, the hybridization of the probe 
will show the following kinetics when two (or more) filter-bound se- 
quences react in the same bag with a probe in solution 



where C is the concentration of single-stranded probe at time /; k is the 
reassociation rate constant for the probe sequence in solution; k { is the 
reassociation rate constant between the probe and the identical sequence, 
/, bound to the filter; and k\ is the reassociation rate constant between the 
probe and the cross-hybridizing sequence H\, bound to the filter. 

The kinetics will differ significantly, depending on whether the filter- 
bound nucleic acid or the probe is in excess, and in the latter case whether 
or not the probe can itself reassociate. For simplicity, let us assume that 
all filter-bound sequences are equal in mass (/ = //j), and that k f = k. (In 
fact kf < k 9 but the exact magnitude of the difference is not known, and it 
does not change the results qualitatively.) Then, if the filter-bound nucleic 
acid is in vast excess, for example, in typical dot blots, 3 the hybridization 
reactions follow pseudo-first-order kinetics; when they go to completion, 
the fraction of the probe hybridized to sequence i becomes 



where m is the number of filter-bound sequences. 

At any time during the reaction, the ratio of the amount of probe 
hybridized to sequence i relative to sequence j is given by k\/kj (Fig. 2); 
i.e., the ratio is invariant with respect to time. In this case, discrimination 
between self- and cross-hybrids is not affected by the extent of the reac- 
tion (Fig. 2), because all the filter-bound sequences continuously compete 
for the same, limiting probe. (Discrimination will be affected if different 
samples are hybridized separately, but that situation does not normally 
apply.) 

If the radioactive probe is in excess over the filter-bound sequences 
(e.g., typical Northern blots, 5 genomic Southern blots, 4 and initial screens 
of genomic or cDNA libraries), discrimination will generally decline as 



dddt = -kC 1 - k { IC - k x H x C 



(1) 




(2) 
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2 6 10 

C o ! 1/2 

Fig. 2. Effective discrimination between self-hybrids and cross-hybrids, as a function of 
the extent of the reaction. The solid line shows the ratio of the amounts of probe hybridized 
to filter-bound heterologous and homologous sequences, E x {t)iE{t), for each of the three 
cases discussed in the text: filter-bound sequence in excess, double-stranded (ds), denatured 
probe in excess, and single-stranded (ss) (e.g., M13) probe in excess. The discrimination 
ratio, hjk, is assumed to be the same in all cases, 0.1. The dashed line shows normal C 0 t 
kinetics of denatured, double-stranded DNA reannealing in solution. 

the reaction proceeds. If the probe in solution can self-anneal, the rele- 
vant equation is 

E x (t)IE{t) = [1 - (1 + kC 0 t)-"] [1 - (1 + kC 0 tr l ]' x ( 3 ) 

where £(/), E\(t) = extent of the reaction, i.e. the fraction of filter-bound 
DNA that has hybridized to the probe, for the self-hybrid and cross- 
hybrid i, respectively; C 0 = concentration of single-stranded probe at time 
0; n = k-Jk, i.e., the discrimination ratio. 

It can be seen that the effective discrimination, i.e., extent of cross- 
hybridization over self-hybridization, equals the discrimination ratio kjk 
very early in the reaction and deteriorates with the kinetics described 
above and as shown in Fig. 2. 



276 



DNA ISOLATION, HYBRIDIZATION, AND CLONING 



[19] 



If the probe in solution cannot self-anneal (e.g., M13 probes), the 
effective discrimination is given by 

E (t) 

_Al = [i _ e -*iC 0 /] [! _ e -*Co/]-i (4) 

The parameters are defined above. Effective discrimination again equals 
kjk very early in the reaction, but deteriorates very rapidly (Fig. 2). 

In summary, hybridizations should ideally be done for very short 
times: discrimination among various clones will then be maximal, and 
the same under conditions of either probe excess or filter-bound DNA 
excess. In practice, however, short hybridization times may not generate 
a sufficient autoradiographic signal. Thus, to distinguish among related 
sequences, it is best to use conditions of filter-bound DNA excess, as in 
typical dot blots 3 (see below). 



A Strategy for Systematic Analysis of Multigene Families 

Recovery of the Family 

To begin the study of a multigene family for which a probe already 
exists, we recommend that a sublibrary be prepared by screening the 
appropriate library (cDNA or genomic) at a very permissive criterion 
[k x — k, n = 1 in Eq. (3)], using the Grunstein-Hogness 2 23 or Benton- 
Davis 1 procedures. At this stage we do not attempt to achieve discrimina- 
tion, but only to recover as complete a collection of family members as 
possible; as discussed above, these two goals cannot be pursued simulta- 
neously. A criterion of 30-35° below T m and high salt (e.g., 0.6 M NaCl) 
for speeding up the hybridization reaction are recommended. To avoid 
unnecessary increase in the background, which is inherently high in the 
initial screening, the criterion should not be relaxed more than is neces- 
sary for the particular family. For completeness of the sublibrary, even 
very weakly hybridizing clones should be collected — especially since the 
plaque or colony size is frequently not uniform. 

Such an initial screen will usually produce a mixture of true and false 
positives, which are distinguished by rescreening. Typical sublibraries 
that we have recovered in our study of chorion genes include from 200 to 
600 clones, too many to make isolation of DNA from each clone practical. 
For sublibraries using bacterial plasmids as vectors, bacterial cultures are 
arranged in Microtiter plates, replica plated onto a nitrocellulose filter, 
placed on nutrient agar, and rescreened by the method of Grunstein and 
Hogness. 2 We have developed a similar replica plating method to rescreen 

23 M. Grunstein and J. Wallis, this series, Vol. 68, p. 379. 



[19] 



ISOLATION OF MULTIGENE FAMILIES 



bacteriophage \ genomic sublibraries (see below). An example of the 
power and reproducibility of this method is shown in Fig. 3. Part a of Fig. 
3 shows a rescreening of a genomic sublibrary under the same hybridiza- 
tion conditions as for the initial screen of the total library. This sublibrary 
had been selected to include genes closely homologous to the cDNA 
clone pc401, a B-family chorion sequence from Antheraea polyphe- 
nols? 21 APcllO, a genomic clone containing two copies of chorion gene 
401, and APcl73, another genomic clone containing two copies of gene 
10, a distantly related B-family sequence, were included as positive and 
negative controls. 9 A few blank spots were seen, corresponding to false 
positives of the initial screen, and a few clones grew poorly. Among the 
rest, some intensity differences were evident. However, even the nega- 
tive control (APcl73), showed significant hybridization under these con- 
ditions. 

The next step is plaque or colony hybridization under stringent condi- 
tions. Figure 3b shows typical results, for the same sublibrary as in Fig. 
3a. In this case, stringency was enhanced by a 15° increase in temperature 
and by use of a more specific probe. Instead of the complete pc401 DNA 
(565 bp), the probe contained a 257 bp fragment from the region corre- 
sponding to the 3' end of pc401 mRNA. This probe encodes only a small 
portion of the conserved central protein domain, and consists largely of 
the more variable COOH-terminal arm sequence and nonconserved 3' 
untranslated region. 9 Only 122 bp of this 3'-specific probe is well con- 
served within the B family (9% mismatch relative to pclO), whereas the 
entire pc401 probe includes a total of 255 bp of well conserved sequence. 
Clearly, the experiment of Fig. 3b achieved high discrimination. Hybrid- 
ization with APcl73 and with many of the clones in the sublibrary was 
essentially undetectable; only a few clones hybridized as intensely as 
APcl 10 itself, and others could be assigned to 3 or 4 classes on the basis 
of their degree of hybridization. 

We perform plaque hybridizations to bacteriophage \ genomic subli- 
braries by the following procedure. 

Preparation of Filters. Bacterial lawns are prepared by mixing 0.25 ml 
of 10 mM MgCl 2 -10 mM CaCl 2 , 0.5 ml of fresh bacterial overnight 
culture, and 6.5 ml of medium in 0.8% agarose. The mixture is poured 
over bottom agar (medium with 1.2% agar) in 150-mm petri dishes. Care 
should be taken to avoid air bubbles. We find that the bottom agar should 
be poured at least 2 days in advance to avoid condensation during phage 
growth. Allow the top agar mixture to gel for 10 min at room temperature. 
Using a replica plater, 24 transfer a small aliquot of each phage stock from 

24 M. Brenner, D. Tisdale, and W. F. Loomis, Exp. Cell Res. 90, 249 (1975). 
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Fig. 3. An Antheraea polyphemus genomic DNA library was screened as described by 
Benton and Davis 1 with a combination of two chorion cDNA probes, pcl8 and pc401. (The 
genes encoding chorion protein 18 are members of the A family and are each paired with a B- 
family gene encoding chorion protein 401.) Plaque-purified phage stocks from all positive 
clones were prepared. Aliquots of each stock were delivered into Microtiter wells and plated 
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Microtiter plates to the bacterial lawn. Allow the liquid to be absorbed 
into the agar (about 30 min), invert the plates, and incubate overnight at 
37°. Uniform plaques of about 6 mm should develop. Chill the plates at 4° 
for 10-30 min before placing a sheet of dry nitrocellulose directly on the 
plaque-containing lawn. Precut nitrocellulose circles are commercially 
available but we find it more economical and convenient to use rectangu- 
lar sheets cut to 8.5 x 12.5 cm. Allow the DNA to transfer for 60 to 90 sec, 
remove the filter from the plate, and denature and fix the DNA to the 
nitrocellulose by soaking the filter in 0.1 M NaOH, 1.5 M NaCl for about 
30 sec. Neutralize the filter by soaking in 0.5 M Tris-HCl, pH 7.5, 0.5 M 
NaCl for about 30 sec. Blot the excess buffer on 3 MM paper, air-dry, and 
bake under vacuum for 2 hr at 80°. We find that the same plate can be used 
to prepare at least four filters with no apparent loss in signal. 

Prehybridization. Filters are prehybridized in 2-4x SET buffer (lx 
SET buffer is 0.15 M NaCl, 0.03 M Tris-HCl, pH 8, 1 mM EDTA), 10x 
Denhardt's solution, 0.2% SDS, and 50 /tg/ml herring sperm DNA for at 
least 3 hr at the hybridization temperature. 

Hybridization and Washing. Filters are hybridized in prehybridization 
mixture supplemented with the radioactive probe and dextran sulfate to a 
final concentration of 10%. Although we normally hybridize overnight 
(12-16 hr), we find that the signals are sufficiently strong after hybridizing 
for only 5 hr. Filters are washed for 20 min each in two changes of 2x 
SET, 0.2% SDS and two changes of 1 x SET, 0. 1% SDS at the hybridiza- 
tion temperature. 

Discrimination by Dot Hybridization 

Once experiments such as those of Fig. 3b have focused attention on a 
limited number of clones, more refined analysis can be undertaken by dot- 
blot hybridization. The original paper describing the method 3 should be 
consulted, as well as a more recent review. 25 In this procedure, DNAs are 
purified from the clones of interest and immobilized on filters in precisely 

" F C Kafatos, G. T. Thireos, C. W. Jones, S. G. Tsitilou, and K. Iatrou, in "Gene 
Amplification and Analysis," Vol. 2: "Structural Analysis of Nucleic Acids (J. G. 
Chirikjian and T. P. Papas, eds.), p. 537. Elsevier/North-Holland, Amsterdam, 1981. 



onto a lawn of bacteria. DNAs from the resulting plaques were transferred to nitrocellulose 
and hybridized to: {a) nick-translated pc401 , at 65°, 0.6 M NaCl; (b) a nick-translated i Kpn I 
fragment of pc401 representing the 3' half of the mRNA, at 80°, 0.3MNaCUn each case 
hybridized filters were washed at the hybridization temperature in 0.3 M NaCl and 0.15 M 
NaCl APcllO (a genomic clone containing two copies of 401) and APcl73 (a genomic clone 
containing two copies of the distantly related, B-family chorion gene 10) were used as 
positive and negative controls. 
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equal amounts and in dots of uniform diameter. As a result, after hybrid- 
ization the intensity of the various dots can be estimated semiquanti- 
tatively, by visual comparison to standards consisting of a dilution series 
of radioactive DNA directly spotted on a similar filter. 

Figure 4 shows examples of dot-blot hybridizations. 22 The panel la- 
beled "Standard" is a 2-fold dilution series that spans a 64-fold range of 
radioactivity. The panel on the right exemplifies the discrimination that 
can be achieved between homologs by judicious choice of conditions. The 
experiment was performed in 50% formamide, and pc401 cRNA was used 
as the probe. At 40°, all seven members of the B family included in this 
filter hybridized, albeit to different extents; five clones of the A family 
were also present in the filter, but did not detectably hybridize. At 50° and 
64° two dots hybridized intensely and almost equally; one was pc401 itself 
(top) and the other pc602, which differs from pc401 by less than 1% 
mismatching. Weak hybridization with pclO was observed at 50° (about 
10-fold less than the self-hybrid), but none at 64°. 

The remainder of Fig. 4 exemplifies a different, widely applicable use 
of dot-blot hybridization: assaying the concentrations of various se- 
quences in a series of samples. Such assays are enormously more conven- 
ient and more sensitive than assays by liquid hybridization and, by careful 
attention to the conditions, are only slightly less accurate. The example 
shown permitted classification of 12 chorion cDNA clones into "early, 
"middle," and "late" developmental classes, depending on their prefer- 
ential hybridization to mRNA preparations from the corresponding devel- 
opmental stages (lb, V, and Xa, respectively). In this case, the cloned 
DNAs were fixed to the filter and the labeled probes were mRNAs con- 
taining these sequences in unknown amounts. It is also possible to attach 
the unknown samples (either DNA or RNA) to the filter and probe them 
with a specific cloned sequence; examples of this approach are given by 



Fig 4 Discrimination of homologous sequences by dot hybridization." Top left: Chang- 
ing concentrations of specific chorion RNA sequences during d ^ m ^ ^ { ^ s 
tahiing cytoplasmic RNA from Antheraea polyphemus follicles at stages lb. V, or J& was 
isolated/end-labeled with »P after alkali treatment, and hybridized to replicate filters con- 
taining equal amounts of spotted DNA from 12 different chorion cDNA clones. Hybndiza- 
tions were performed in 50% formamide, 0.6 M NaCl for 28 hr (59°) or 40 hr (64 ). As a 
ontrol VoZr left) one filter was hybridized with »P labeled pML-21 DNA thevee^ 
in cloning. Right: Cross-hybridization of chorion cDNAs. DNAs from 12 chorion cDN A 
clones (7 from the B family and 5 from the A family) were again spotted on nitrocellulose, 
^-labeled 401 cRNA was prepared using E. coli RNA polymerase and the . nuclease S 
excised insert of P c401 . Hybridizations were performed in 50% formamide. 0^6 M NaCl at 
the temperatures indicated. Standard: A two-fold dilution series of [ 3 -P]DNA spotted di- 
rectly on nitrocellulose. 
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Weisbrod and Weintraub. 26 In the case of multigene families, it is impor- 
tant to distinguish the concentrations of identical vs related sequences. In 
Fig. 4, this was accomplished by performing the experiment at both 59° 
and 64°. 

Exactly the same approach can be used to characterize members of 
the multigene families by hybrid-selected translation 6 : one hybridizes 
DNA dots under discriminating conditions with "probe" consisting of 
unlabeled mRNA and then melts the hybrids and translates the RNA in a 
cell-free system by standard procedures. If hybridization is performed 
under conditions that permit formation of some cross-hybrids, this tech- 
nique can be used to detect the degree of homology between protein 
species for which sequence information is not available. 27 28 

An apparatus is now available from Bethesda Research Laboratones 
and Schleicher & Schuell for performing dot blots with multiple samples 
stored in Microtiter plates. Dot blots can also be performed manually, and 
we recommend the following procedure. 25 

Preparation of Filters. All filters are washed in water for 1 hr. A plain 
nitrocellulose filter (22 mm in diameter) is mounted on top of two nitrocel- 
lulose filters with grids, on a sintered-glass platform connected to a water 
aspirator. The filters are washed 3-4 times with 1 M ammonium acetate 
before spotting with DNA. 

Plasmid DNA is linearized by restriction endonuclease treatment and 
digested with proteinase K (200 /ng/ml in 50 mM Tris-HCl, pH 7.5; at 37°, 
30 min each with 0.2% and 2% SDS). After phenol extraction the DNA is 
denatured in 0.3 N NaOH for 10 min and chilled; when needed, it is 
diluted with an equal volume of cold 2 M ammonium acetate to a concen- 
tration of 1 .4 /xg/ml. It is then taken up in a capillary pipette attached to a 
micropipette filler (Clay Adams suction apparatus No. 4555), and is spot- 
ted on the filter under light vacuum; once the pipette touches the filter, 
contact is maintained continuously, while the DNA solution is delivered 
slowly with a combination of vacuum suction and positive pressure from 
the filler. We routinely spot 0.7 /tg of DNA in 50 fil per dot, using a 100-/xl 
micropipette. The dot is washed with a drop of 1 M ammonium acetate, as 
is the area to be spotted next. After all the dots are made, the filter is 
washed again with 1 M ammonium acetate under suction, air dried, 
treated with 2x Denhardt's solution for 1 hr, drained, air-dried, and 
baked at 80° for 2 hr. 

Prehybridization. The dried filters are wetted evenly by slow immer- 
sion in 10 x Denhardt's-4x SET buffer and are shaken in that solution for 

26 S. Weisbrod and H. Weintraub, Cell 23, 391 (1981). 

27 N. K. Moschonas, Ph.D. Thesis, Univ. of Athens, Athens, Greece, 1980. 

28 G. Thireos and F. C. Kafatos, Dev. Biol. 78, 36 (1980). 
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at least 1 hr They are transferred to sterile, siliconized scintillation vials 
containing blank hybridization mixture, e.g., 50% deionized formamidc, 
2 x Denhardt's solution, 4x SET, 0.1% SDS, 100 pg of yeast tRNA and 
125 i*g of poly(A) per milliliter, and are incubated for at least 1 hr at the 
hybridization temperature. , 8 . • 

Hybridization and Washing. The filters are hybridized for 16-48 hr in 
hybridization mixture (prehybridization mixture supplemented with ra- 
dioactive probe). The temperature is selected to give the desired criterion 
of stringency. Washes are performed at the same temperature twice each 
with 4^, 2x, ix, 0.4X, and 0.2x SET, all with 0.1% SDS. Two final 
washes are performed at room temperature, with 0.1 x SET without SDS. 
The filters are covered with Saran wrap and autoradiographed, either dry 
or moist (if melts are to be undertaken). 

Discrimination by Southern and Northern Analysis 

Discrimination between members of a multigene family, by careful 
control of hybridization conditions and selection of appropriately specific 
probes, can also be accomplished in Southern and Northern transfer ex- 
periments. While dot hybridization is preferable for quantitation and is 
somewhat more convenient, Northern analyses permit detection of the 
sizes of the hybridizing transcripts (which may vary during development), 
and Southern experiments using genomic DNA clones can identify re- 
striction fragments that contain the cross-hybridizing sequences^ Thus, 
these three filter hybridization procedures are complementary, and all are 
apt to be used in the study of a multigene family. 

Figure 5 shows an example of discrimination by low- and high-strin- 
gency Southerns. The probe in this case was the high-cysteine cDN A 
clone m2574 of Bombyx mori? and the filter-bound DNAs were derived 
from 18 overlapping genomic clones (Bl to B18) totaling 270 kb of contin- 
uous DNA spanning the high-cysteine region of the chorion locus. Fit- 
teen m2574-like genes were detected by low-stringency hybndiza ion 
(75° 0 6 M NaCl). High-stringency hybridization permitted identification 
of gene 12 as the one from which m2574 was derived, genes 14 and 15 as 
very close homologs, genes 8 and 10 as moderately close homologs, and 
the remaining 10 genes as more distant homologs. In this case, high strin- 
gency was accomplished by high temperature (85°, 0.3 M NaCl) and by 
the use of a relatively high concentration of DNase during nick-translation 
of the probe. The high DNase approach is recommended for maximizing 



» K. Iatrou, S. G. Tsitilou, and F. C. Kafatos, J. Mol.Biol. 157, 417 (1982). 
»T. H. Eickbush and F. C. Kafatos. Cell 29, 633 (1982). 
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discrimination when sequence information is not available, and therefore 
specific probes cannot be tailored as in Fig. 3b. High DNase results in 
short probe lengths, and, as a result, if mismatching is nonrandomly dis- 
tributed, only the short fragments that might contain a conserved se- 
quence will cross-hybridize at stringent conditions. By contrast, with long 
probes, signals from hybrids that are stable because of a short conserved 
sequence will be enhanced by the presence of covalently attached un- 
paired tails, resulting in decreased discrimination. 

In conclusion, deliberate control of the stringency of hybridization is 
possible in all filter hybridization procedures and is a powerful tool in the 
isolation and characterization of multigene families. 
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Tails to Allow Cloning of 5'-Terminal mRNA Sequences 
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Cloning of mRNA sequences after reverse transcription into cDNA 
copies 1 " 3 plays an important role in the analysis of gene structure and 
function. A variety of methods for cloning cDNAs in bacterial plasmids 
has been described.^ 10 In the method most commonly used, 7 "* the ability 

> I. Verma, G. F. Temple, H. Fan, and D. Baltimore, Nature {London) New Biol. 235, 163 

*D 97 l! Kacian, S. Spiegelman, A. Bank, M. Terada, S. Metafora, L. Dow, and P. A. 

Marks, Nature {London) New Biol. 235, 167 (1972). n<5721 
3 J. Ross, H. Aviv, E. Scolnick, and P. Leder, Proc. Natl. Acad. Sc. U.S.A. 69, 264 (1972). 

* F. Rougeon, P. Kourilsky, and B. Mach, Nucleic Acids Res. 2, 2365 (1975). 
3 T. H. Rabbits, Nature {London) 260, 221 (1976). 

* K. 0. Wood and J. C. Lee, Nucleic Acids Res. 3, 1961 (1976). 

■> T. Maniatis, S. G. Kee, A. Efstratiadis, and F. C. Kafatos, Cell 9, 163 (1976). 

8 R. Higuchi, G. V. Paddock, R. Wall, and W. Salser, Proc. Natl. Acad. Set. U.S.A. 73, 

» F. Rouglon and B. Mach, Proc. Natl. Acad. Sci. U.S.A. 73, 3418 (1976). 
>• S. Zain, J. Sambrook, R. J. Roberts, W. Keller, M. Fried, and A. R. Dunn, Cell 16, 851 
(1979). 

Copyright ® 1983 by Academic Press. Inc. 
METHODS IN ENZYMOLOGY. VOL. ,00 A.I rights of reproduction i» -.y^ 



